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5 

SELF-REARRANGING DNA VECTORS 

Background of the Invention 
The invention relates to DNA vectors. 

10 MammaUan cell expression vectors based on DNA viruses have been widely 

discussed as gene delivery vehicles for genetic therapy. Among the different DNA 
viruses proposed for this purpose have been adenoviruses, baculovirus, Epstein Barr 
virus, and herpes simplex virus, hi addition other smaller viruses that have an 
intranuclear phase in which the viral genome is present as a double stranded DNA, such 

15 as retroviruses and parvoviruses, have been proposed as gene delivery vehicles. 

Adenoviral vectors (AdV), for example, have a recognized potential for gene 
delivery, founded in their broad host range, robust growth in culture, and capacity to 
infect mitotically quiescent cells (Graham and Prevec, Manipulation of adenovirus 
vectors, p. 109-128, In E. J. Murray (ed.), Methods in Molecular Biology, vol. 7, 

20 Humana, Clifton, NJ, 1991; Trapnell and GorzigUa, Curr. Opin. Biotechnol. 5:617-625, 
1994). AdV can be propagated in a helper cell line, 293, a human embryonic kidney cell 
line transformed by adenovirus type 5 (Graham et al., J. Gen.Virol. 36:59-72, 1994). 
293 cells express the viral El gene products (El a and Elb) that are the master regulatory 
proteins for subsequent viral gene expression. El deleted viruses can propagate in 293 

25 cells, but not in other cells. Although it would be expected that El deleted viruses lack 
the machinery to express viral genes, several studies have demonstrated that cellular El- 
like components can stimulate viral gene expression (Imperiale et al., Mol. Cell. Biol. 
4:867-74, 1984; Onclercq et al., J. Virol. 62:4533-7,1988; Spergel et al., J. Virol. 
66:1021-30, 1992). The expression of these viral genes results in the relatively rapid 

30 elimination of transduced cells in vivo as a result of cytotoxic T cell responses (Yang et 
al.. Immunity 1:433-42, 1994;. Yang et al.. Gene Ther. 3:137-44, 1996; Yang et al., J. 
Virol. 69:2004-15, 1995). 

Thus attention has been focused on eliminating the remaining vestiges of viral 
expression. Viral genes that have been deleted for this purpose include the gene for E4 

35 proteins (Armentano et al., Hum. Gene Ther. 6:1343-53, 1995; Kochanek et al., Proc. 
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Natl. Acad. Sci. USA 93:5731-6, 1996; and Yeh et al., J. Virol. 70:559-565, 1996), 
DNA binding protein (Engelhardt et al., Proc. Natl. Acad. Sci. USA 21:6196-6200, 
1994; and Gorziglia et al., J. Virol. 70:4173-8, 1996), DNA polymerase (AmaMtano et 
al., J. Virol. 72:926-33, 1998), and the preterminal protein (Schaack et al., Proc. Natl. 
5 Acad. Sci. USA 93: 14686-91, 1996). The most aggressive approach has been the 

creation of helper virus-dependent vectors that lack all viral genes (Hardy et al., J. Virol. 
71:1842-9, 1997; Kochaneket al., Proc. Natl. Acad. Sci. USA 93:5731-6, 1996; Lieber 
et al., J. Virol. 70:8944-60, 1996; Mitani et al., Proc. Natl. Acad. Sci. USA 92:3854-8, 
1995; and Parks et al., Proc. Natl. Acad. Sci. USA 93:13565-13570, 1996). These 
10 vectors have high capacity, evoke reduced cellular immune responses and show 

prolonged expression in vivo (Morsy et al., Proc. Nad. Acad. Sci. USA 95:7866-71, 
1998). However to deploy these viruses on the scale required for human clinical 
application presents major challenges because a cesiimi chloride (CsCl) gradient is 
needed to remove the helper virus. 

15 

Summary of the Invention 
In one aspect, the invention features a replicatable viral DNA vector encoding a 
site-specific DNA-altering enzyme and a DNA target recognized by said enzyme, said 
enzyme selectively converting, in a cell expressing said enzyme, said DNA vector to a 
20 rearranged form. 

In preferred embodiments, the rearranged form includes an autonomously 
replicating episome and a linear DNA product. In other preferred embodiments, the 
vector comprises adenoviral DNA. 

In yet other preferred embodiments, the vector includes a genetically-engineered 
25 recombination site (such as a target of Cre or FLP). Preferably, such a recombination 
site includes a recognition sequence of a site-specific DNA altering enzyme. 

In another preferred embodiment, the site-specific DNA altering enzyme is a 
recombinase (such as Cre or FLP) or an integrase. Preferably, such an enzyme is 
functional in a mammalian cell. Preferred embodiments of the vector also include an 
30 origin of replication that functions in a mammalian cell (such as an Epstein Barr Virus 
replicon). Moreover, the vector typically includes a gene of interest (such as a 
therapeutic gene that encodes a protein or polypeptide or an RNA product). 
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In another aspect, the invention features a method for assembling a recombinant 
adenoviral DNA. The method, in general, includes the steps of: (a) providing a first 
linearized DNA vector comprising a restriction site and a cos site and a second 
linearized DNA vector comprising the restriction site, an adenoviral nucleic acid 
5 molecule, and a cos site; and (b) ligating the first and second linearized DNA vectors, 
the ligation assembling a recombinant adenoviral DNA. 

In preferred embodiments, the first linearized DNA vector comprises a selectable 
marker (such as a gene encoding a polypeptide that confers, on a host cell expressing 
such a polypeptide, resistance to an antibiotic). In other preferred embodiments, the first 
10 linearized DNA vector includes an adenoviral left-end inverted terminal repeat; a gene 
of interest; or both. In still other preferred embodiments, the second linearized DNA 
vector includes a selectable marker. Preferably, the second linearized DNA vector 
includes an adenoviral right-end inverted terminal repeat. 

The method further includes packaging the assembled adenoviral DNA into a 
15 phage and infecting a host cell. Typically the first and second linearized DNAs include 
cosmid vector DNA. In addition, such adenoviral DNA is typically flanked by cleavage 
sites (such as intron endonuclease cleavage sites). 

In another aspect, the invention features an adenovirus producer cell having a 
nucleic acid molecule that expresses a dominant negative site-specific DNA-altering 
20 enzyme. In preferred embodiments, the site-specific DNA altering enzyme is a 

dominant negative recombinase (for example, a Cre recombinase such as CreY324C or a 
Flp recombinase). Exemplary adenovirus producer cells include, without limitation, 293 
human embryonic kidney cells, per.C6 cells, and N52 cells. 

In yet another aspect, the invention features a vector comprising, in the 5' to 3' 
25 direction, a first genetically engineered cis-acting target recognized by a site-specific 
DNA altering enzyme; a gene of interest; a lineage-specific gene promoter; a second 
genetically engineered cis- acting target recognized by a site-specific DNA altering 
enzyme; and a nucleic acid molecule encoding a site-specific DNA altering enzyme. 
In still another aspect, the invention features a vector including, in the 5' to 3' 
30 direction, a first genetically engineered cz^-acting target recognized by a site-specific 

DNA altering enzyme; a gene of interest; a bi-directional promoter, comprising a second 
genetically engineered cz^-acting target recognized by a site-specific DNA altering 
enzyme; and a nucleic acid molecule encoding a site-specific DNA altering enzyme. 
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la related aspects, the invention features a method of gene therapy including the 
administration to a patient in need of gene therapy a therapeutically effective amount of 
the vector of the invention, which is expressed iq the patient. The invention further 
relates to a population of cells transfected with the vector of the invention. 
5 Accordingly, the invention further relates to the use of a recombinant viral vector 

or use of a recombinant viral particle for gene therapy. Such vectors and viral particles 
may be introduced either in vitro into a host cell removed iBrom the patient, or directly in 
vivo, into the body to be treated, according to standard methods known in the art. 
The invention also relates to a pharmaceutical composition that includes a 

10 therapeutically effective amount of a recombinant viral vector or viral particle prepared 
according to the methods disclosed herein, in combination with a vehicle that is 
acceptable from a pharmaceutical standpoint. Such a pharmaceutical composition may 
be prepared according to the techniques commonly employed and administered by any 
known adnainistration route, for example systemically (in particular, by intravenous, 

15 intratracheal, intraperitoneal, intramuscular, subcutaneous, intratumoral, or intracranial 
routes) or by aerosolization or intrapulmonary administration. 

One skilled in the art will appreciate that suitable methods of administering a 
vector (particularly an adenoviral vector) of the present invention to an animal for 
purposes of gene therapy, chemotherapy, and vaccination are available, and, although 

20 more than one route can be used for administration, one particular route may provide a 
more immediate and more effective reaction than another. Pharmaceutically acceptable 
excipients also are well known to those who are skilled in tiie art, and are readily 
available. The choice of excipient will be determined, in part, by the particular method 
used to administer the recombinant vector or particle. Accordingly, there are a wide 

25 variety of suitable formulations for use in the context of the present invention. 

By "recombinant DNA vector" is meant a DNA sequence containing a desired 
sequence (such as a gene of interest) and an appropriate regulatory element(s) necessary 
for the expression of the operably linked sequence in a particular host organism (such as 
a mammal), 

30 By "operably linked" is meant that a gene and a regulatory element(s) are 

connected to permit gene expression when the appropriate molecules (for example, 
transcriptional activator proteins) are bound to the regulatory sequence(s). 
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By "regulatory element" is laeant a genetic element that controls some aspect of 
the expression of a nucleic acid sequence. For example, a promoter is a regulatory 
element that facilitates the initiation of transcription of an operably linked coding region. 
Other genetic regulatory elements include, without limitation, splicing signals, 
5 polyadenylation signals, and termination signals. For example, transcriptional 

regulatory elements in eukaryotes include promoter and enhancer elements. Promoters 
and enhancers include arrays of DNA sequences that interact directly or indirectly with 
cellular proteins involved in transcription. Promoter and enhancer elements have been 
isolated from a variety of eukaryotic sources including genes in mammalian cells and 
10 viruses. 

By "transfection" is meant the introduction of foreign DNA into eukaryotic cells. 
Transfection is typically accomplished by a variety of means known in the art including, 
without limitation, calcium phosphate-DNA co-precipitation, DEAE-dextran-mediated 
transfection, electroporation, microinjection, liposome fusion, lipofection, protoplast 

15 fusion, and biolistics. 

By "stably transfected" is meant the introduction of foreign DNA into the 
genome of the transfected cell. In general, transfer and expression of transgenes in 
mammalian cells are now routine practices to those skilled in the art, and have become 
major tools to carry out gene expression studies and to generate vectors useful in gene 

20 therapy. 

By "gene of interest" is meant a gene inserted into a vector whose expression is 
desired in a host pell. Genes of interest include, without limitation, genes having 
therapeutic value, as well as reporter genes. A variety of such genes are useful in the 
invention, including genes of interest encoding a protein, which provides a therapeutic 

25 function. In addition, the gene of interest, if a therapeutic gene, can render its effect at 
the level of RNA, for instance, by encoding an antisense message or ribozyme, a protein 
which affects splicing or 3' processing (e.g., polyadenylation), or it can encode a protein 
which acts by affecting the level of expression of another gene within the cell (i.e., 
where gene expression is broadly considered to include all steps from initiation of 

30 transcription through production of a processed protein), for example, by mediating an 
altered rate of mRNA accumulation, an alteration of mRNA transport, and/or a change 
in post-transcriptional regulation. 
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By "reporter gene" is meant a gene sequence that encodes a reporter molecule 
(including an enzyme). A "reporter molecule" is detectable in any detection system, 
including, but not limited to, enzyme (e.g., ELISA, as well as enzyme-based 
histochemical assays), fluorescent, radioactive, and luminescent systems. Exemplary 
5 reporter gene systems include the E. coli beta-galactosidase or glucuronidase genes, 
green fluorescent protein (GFP), blue fluorescent protein (BFP), the human placental 
alkaline phosphatase gene, the chloramphenicol acetyltransferase (CAT) gene; other 
reporter genes are known in the art and may be employed as desired. 

By "transgene" is meant any piece of DNA, which is inserted by artifice into a 
10 cell, and becomes part of the genome of the organism, which develops fi:om that cell. 

Such a transgene may include a gene that is partly or entirely heterologous (i.e., foreign) 
to the transgenic organism, or may represent a gene homologous to an endogenous gene 
of the organism. 

By "transgenic" is meant any cell that includes a DNA sequence, which is 
15 inserted by artifice into a cell and becomes part of the genome of the organism, which 
develops from that cell. 

By "polypeptide" is meant any chain of amino acids, regardless of length or post- 
translational modification (for example, glycosylation or phosphorylation). 

By "derived from" is meant isolated fi:om or having the sequence of a naturally 
20 occurring sequence (e.g., a cDNA, genomic DNA, synthetic, or combination thereof). 

By "nucleic acid" is meant a polynucleotide (DNA or RNA). 

By "gene" is meant any nucleic acid sequence coding for a protein or an RNA 
molecule. 

By "gene product" is meant either an untranslated RNA molecule transcribed 
25 from a given gene or coding sequence (such as, mRNA or antisense RNA) or the 

polypeptide chain translated from the mRNA molecule transcribed from the given gene 
or coding sequence. Nucleic acids according to the invention can be wholly or partially 
synthetically made, can comprise genomic or complementary DNA (cDNA) sequences, 
or can be provided in the form of either DNA or RNA. 
30 The presently claimed invention affords a number of advantages. For example, 

applicants' gene therapy vehicles particularly those based on recombinant adenoviruses, 
minimize the propensity of the vectors to activate host immune surveillance, and thereby 
maxinaize the persistence for the DNA transduced. The invention therefore facilitates 
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the development of gene delivery vectors designed to enhance persistence of virally 
delivered genes and evade the cellular immune response by severing the connection 
between the sole adenoviral enhancer and the sequences encoding potentially antigenic 
viral proteins. 

5 As described in more detail below, the mechanism by which this is accomplished 

differs significantly from any other previous approaches. For example, to reduce the 
immunogenicity of vectors it is widely acknowledged that some intervention, such as the 
removal of key genes, or the prevention of their expression iu the cells targeted for 
therapy, is important; however, many related approaches are directed at the host and 

10 have generally focused on the selective induction of tolerance to adenoviral antigens, or 
similar strategies directed at inducing a temporally restricted or antigen-specific 
compromise of the immune system. 

In addition, the poor persistence of transduced DNA appears to be due in part to 
immunological rejection of transduced cells and to the inability of the viral DNA to 

15 replicate, a feature generally inherent in the design of adenoviral vectors, but one which 
is not associated with apphcants' claimed gene therapy vehicles. 

Moreover, some contemporary adenoviral vectors are designed to propagate in 
specific host cells which provide essential replication factors in trans. These vectors are 
typically based on cell lines which express the master regulatory proteins of the El 

20 complex, which are required for induction of adenoviral DNA replication. In cells 

expressing El genes, the best studied of which is a hxmian embryonic kidney cell line 
transformed by DNA from hirnian adenovirus 5 (called HEK293, or simply 293), viruses 
lacking El genes propagate well. Such viruses do not propagate on cell lines which do 
not express El, and do not generally propagate well in the target cells to which the 

25 therapeutic gene is to be delivered. Cells transduced with El -deleted adenovirus vectors 
also do not express high levels of viral genes in the absence of El. However, the weak 
residual expression that remains in such vectors appears to be sufficient to induce 
cellular immune responses that contribute to the destruction of the transduced cells. 

In addition, the gene therapy vectors claimed herein are hybrid vectors capable of 

30 self-rearrangement to form circular and linear DNA products. The linear DNA has a 
compromised ability to express adenoviral genes, and therefore has a lower 
immunological profile. And the circular DNA behaves like a mammalian plasmid, 
encoding the gene of interest and persisting by autonomous replication in the nucleus. 
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For example, the circularization of an adenoviral vector via the action of Cre 
recombinase beneficially places a gene of interest (for example, a therapeutic gene) on a 
self-replicating episome. Vector circularization occurs in a tissue-targeted manner, for 
example, as a result of the activation of a synthetic liver-specific promoter upstream of 
5 the recombinase Cre. Once circularized, the EB V replicon in the episome confers 
improved persistence on the therapeutic gene as detected by reporter gene expression 
and direct assay for the presence of vector DNA sequences. 

Furthermore, the invention eliminates the requirement for a helper virus, thus 
avoiding two potential limitations of that system. First, the continuous expression of Cre 

10 recombinase may lead to toxicity in host cells, either as a direct consequence of the 
protein's activity or via its immunogenicity. Second, the Cre helper virus may itself 
produce antigenic viral proteins that contribute to the immunologic elimination of 
infected host cells. In contrast, the self-resolving adenovirus/EBV vector system 
disclosed herein advantageously provides no alternative source of viral proteins, and Cre 

15 expression is terminated upon rearrangement. 

In addition, the invention described herein provides tools for analyzing the roles 
of the enhancer in viral gene regulation and virus growth. 

The invention also provides a convenient general system for creating 
recombinant adenoviruses, which increase their attractiveness as gene transduction tools 

20 for basic research. The system, for example, employs two conventional plasmid vectors 
and a X phage packaging step. The entire recombinant AdV genome is assembled into a 
single cosmid that is easily amplified in E ,coli. The use of intron endonuclease 
recognition sequences flanking the ITRs enhances virus production while simplifying 
insertion of therapeutic gene sequences into the pLEP shuttle plasmid. The convenience 

25 of this vector system has facilitated the construction of over two hundred recombinant 
viruses to date. 

Other embodiments and advantages of the invention will be apparent from the 
detailed description thereof, and from the claims. 

30 Brief Description of the Drawings 

FIGURE lA is a schematic diagram of the structure of an adenoviral type A 
vector and its fate in a target cell, enh refers to the Ad2 enhancer; GFP refers to the 
marker gene green fluorescent protein; EBV refers to the Epstein Barr Virus replicon; 
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TetOv refers to a heptamer of Tet operator; TetR refers to the Tet repressor; VP16 refers 
to the viral protein 16 of Herpes simplex virus, SD refers to the splice donor site; and SA 
refers to the splice acceptor site. 

FIGURE IB is a schematic diagram of the structure of an adenoviral type B 
5 vector and its fate in a target cell, enh refers to the Ad2 enhancer; GFP refers to the 
marker gene green fluorescent protein; EB V refers to the Epstein Barr Virus replicon; 
SD refers to the splice donor site; and SA refers to the splice acceptor site. 

FIGURE 2A shows a schematic diagram of the pLEP cosmid polylinker region 
and its position relative to the adenwiral left ITR. The adenovirus enhancer/packaging 

10 sequence (\|/) is boxed. 

FIGURE 2B is a schematic diagram showing the generation of a single cosmid 
encoding the AdV genome by the direct ligation of two smaller plasmids. A gene 
expression unit, CMVGFP, was inserted into the pLEP cosmid at the polylinker region. 
pLEP and pREP cosmids were digested with an intron endonuclease (PI-PspI), ligated, 

15 and packaged in vitro to generate pAd2CMVGFP. This DNA was then digested with 
another intron endonuclease (I-Ceul) to expose the ITRs at both ends of the viral 
genome. Finally, cosmid digestion mixtures were transfected into 293 cells. Plaques 
generated by recombinant viruses are detected in 7-10 days. 

FIGURE 3A shows the restriction analysis of cosmids carrying the full length 

20 AdV DNA showing uniform generation of the desired vector DNA. 2 fig DNA samples 
from four pAd2-7CMVGFP colonies were digested with Bgl II, resolved on a 1% 
agarose gel and stained with ethidium bromide. The predicted sizes of the DNA 
fragments are: 13261, 7684, 5228, 5088, 2284, 1757, 1549, 1270, 351, and 275 base 
pairs (bp). The 5228 and 5088 fragments appear as a doublet, and the 351 and 275 bp 

25 fragments are too small to be seen on the gel. 

FIGURE 3B shows the release of the recombinant Ad DNA from cosmids by I- 
Ceul digestion. 2 jiig of pAd2-7CMV DNA from two clones was digested with I-Ceul. 
Arrows indicated the position of the released recombinant AdV DNA and the vector 
fragments of approximately 35 kb and 5 kb, respectively. 

30 FIGURE 4 A shows the appearance of plaques in 293 cells transfected with 10 jug 

of pIAdGFPB with no ITR exposed (undigested), one ITR exposed (BsaBI or I-Ceul), or 
both ITRs exposed (BsaBI plus I-Ceul). Values represent the mean plaque counts per 



-9- 



wo 02/20814 



PCT/USOl/27682 



dish and the time required for plaque development in 293 cells from three separate 
experiments. "I" designates I-Ceul; and "B" designates Bsa BI. 

FIGURE 4B shows the viral titers obtained from plaques that were allowed to 
grow over 10 days after transfection. Viruses were harvested and the titer of each virus 
5 stock was determined by a GFP based semi-quantitative titration procedure described 
herein. Values represent the mean ± SE of three independent determinations. 

FIGURE 5 is a schematic diagram showing a linear AdV that resolves into a 
circular episome. The elements involved in the self-directed rearrangement of the vector 
are shown schematically in pLEPlBHCRGFP/EBV and in the corresponding AdV. 

10 Starting from the left ITR, the elements are shown as following sequence: left ITR, 147 
bp; first 34 bp loxP site; 185 bp enhancer/packaging signal; 64 bp splicing acceptor (SA) 
from EFla gene first intron; 720 bp GFP cDNA; 230 bp SV40 poly(A); 1.7 kb TK- 
EBNA-l/OriP; 970 bp HCR12 promoter; 1 kb EFla gene first intron containing splicing 
donor (SD) and acceptor (S A) sites with the second loxP site inserted at 64 bp upstream 

15 of the 3 'end; 1.2 kb Cre gene tagged with AUl and a nuclear localization signal; '-120bp 
poly (A) signal and PI-PspI site. After infection of liver cells, the HCR12 promoter 
drives the expression of Cre which results in the cleavage of the two loxP sites. This 
results in the circularization of the fragment containing the EBV replicon. The excision 
severs the connection between the enhancer/packaging signals and the remainder of the 

20 AdV genome. The Cre gene becomes promoterless and is left on the AdV genome 
fragment. After excision, the HCR12 promoter drives the expression of the GFP 
reporter gene. The EBV replicon maintains the excised circle as an episome in host 
cells. 

FIGURE 6 A is a schematic representation of the loxP sites and EBNA-1 
25 locations in the AdV genome. The relevant Bgl II site is also shown. 

FIGURE 6B shows the time course of rearrangement in HepG2 and Hela cells at 
an equal multiplicity of infection (moi) of 1,000 particles per cell. Cells were infected 
with Ad2HCRGFP/EBV viruses for 2 hours at 37 ""C. Hirt DNA samples were extracted 
from the cells. -5 fig of Hirt DNA samples were digested with Bgl 11, fractionated on a 

32 

30 1% agarose gel, and analyzed by Southern blot techniques using a P-labeled EBNA-1 
fragment as the hybridization probe. 
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FIGURE 6C shows the DNA blot results obtained from Hela cells infected at a 
moi of 10,000; and HepG2 at 1,000. The upper bands (4915 bp) represent the 
circularized DNA fragments whereas the lower bands (3162 bp) represent the non- 
circularized AdV. 

5 FIGURE 7A shows green fluorescent protein (GFP) expression in liver and non- 

liver cells infected with the Ad2HCRGFP/EBV viruses. Cells were cultured in 35 mm 
dishes and infected with the Ad2HCRGFP/EBV virus at desired moi. HepG2 cells were 
infected with 1,000 particles per cell, whereas Hela cells were infected with a moi of 
10,000. GFP expression was examined at the indicated time points after infection. 

10 Fluorescent cells were photographed using an Olympus SC35n3m camera mounted on an 
Olympus 1X70 fluorescent microscope, at 200x magnification, using a filter with peak 
excitation and emission wavelengths of 450 mn and 510 nm, respectively. 

FIGURE 7B shows the expression of GFP in HepG2, Hela, A431, and HT29 
cells. Cells were seeded in 35 mm dishes and infected with the Ad2HCRGFP/EBV 

15 virus at a moi of 10,000 particles per cell. GFP expression was examined at 72 hours 
after infection. 

FIGURE 7C shows the expression of GFP in human primary hepatocytes. These 
cells were photographed under bright field (left) and fluorescent conditions (right). 
FIGURE 8 A shows the results of RT-PCR that was performed to detect the 

20 tripartite leader sequence (upper panel) for virus late gene expression; and PCR was 
performed in the DNA samples for detection of the AdV genomes. The specific target 
sequences are described in detail infra. PCR analyses of adenovirus late gene 
expression in cells infected with the first generation AdVs or the self-resolving 
Ad2HCRGFP/EBV was analyzed. HepG2 cells were cultured in 35 mm dishes and 

25 infected with increasing moi (0, 10, 100, 1000, 10,000, and 100,000) of adenoviral 
vectors. RNA and DNA were isolated in parallel firom the cells at 72 hours after 
infection. 

FIGURE SB shows a summary of quantitative RT-PCR and PCR results. Each 
determinant was the average of three experiments. 
30 FIGURE 9A is a schematic diagram depicting the deletion analysis of the OriP 

and EBNA-1 regions of the EBV replicon. Structures of the deletions in EBNA-1 and 
OriP are schematically represented. Elements considered important for episomal 
maintenance are indicated. FR refers to the family of repeats; DS designates the region 
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of dyad symmetry; LRl refers to the so-called linker region 1; GA refers to gly-ala 
repeats; LR2 refers to linker region 2; and Dimerization designates the dimerization 
domain. 

FIGURE 9B is a graph depicting fractions of GFP positive cells carrying the 
5 EB V replicons represented in Figure 9A. 

FIGURE lOA shows the positions and identities of Cre mutants tested for their 
dominant negative Cre activities. 

FIGURE 1 OB is a schematic diagram of the substrate Cre plasmid (ad2239) used 
to test dominant negative functions of Cre mutants. 
10 FIGURE IOC shows GFP expression in cells cotransfected with the substrate Cre 

plasmid (ad2239) and the indicated Cre mutants. 

FIGURES lOD and lOE show Cre mutants tested for their ability to inhibit 
rearrangement. Only those showing the strongest inhibitory activities were retested in 
Fig. lOE. GFP intensity was normalized to that of cells in the absence of inhibition. 
15 FIGURE 1 1 shows GFP expression in 293 TetON cells aud #17 cells transfected 

with ad2239. The ability of #17 cells to inhibit Cre activity is demonstrated by the weak • 
GFP signal in cells treated with 2 doxycycliae. 

FIGURE 12 is a schematic diagram depicting the tetracycline mediated auto- 
regulatory circuit. 

20 FIGURES 13A and 13B show the effects of different basal elements on synthetic 

TetO promoter activity. FIGURE 13 A shows a schematic diagram of the components of 
various auto-regulatory synthetic TetO promoters. FIGURE 13B shows a comparison of 
the strength of auto-regulatory synthetic TetO promoters bearing different basal 
elements, in the presence and absence of tetracycline, using GFP as a marker in HepG2 

25 cells. 

FIGURE 14 shows the structure of a Cre substrate plasmid (ad2265). The 
promoter, Ef la, and the gene, BFP, are interrupted by two loxP sites, which can be 
joined by Cre-mediated recombination. PA stands for poly A; BFP for blue fluorescent 
protein. 

30 FIGURES 15A and 15B show the estrogen regulation of Cre recombinase 

activity. 293 cells infected with type B virus, AD 121. 5, in which the Cre enzyme is 
fused with estrogen ligand binding domain at both the N- and C-termini were cultured in 
the presence or absence of 1 |xM estrogen. Cre-mediated rearrangement in the presence 
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of estrogen is shown in Figure 15 A, whereas blot analysis of extrachromosonaal DNA 

from the same cells is shown in Figure 15B. L represents the position corresponding to 

the unrearranged adenoviral DNA; and C represents the position corresponding to the 

circular form of DNA. 
5 FIGURE 16 shows the rearrangement of adenoviral sequences in vivo. 

Extrachromosomal DNA from the livers of Rag-2 mice sacrificed 2.5 hrs post injection 
. of type A adenovirus, AD102.7, was analyzed by DNA blot. L represents the size 

corresponding to linear adenoviral DNA; and C represents the size corresponding to 

rearranged circular DNA. 
10 FIGURE 17 is a photomicrograph depicting high level GFP expression in Rag2 

mouse hepatic tissues 48 hrs post type A adenovirus (AD102.7) injection. 

FIGURE 18A and IBB show schematic diagrams of the structures of adenoviral 

vectors and their fates in target cells, enh refers to Ad2 enhancer; GFP refers to green 

fluorescent protein; EBV refers to Epstein Barr Virus replicon; TetO? refers to heptamer 
15 of Tet operator; TetR refers to Tet repressor; VP16 refers to transcriptional activator 

domain from HSV protein 16; SD refers to splice donor site; and SA refers to splice 

acceptor site. 

FIGURE 19A shows the structure of a FLP substrate plasmid, ad2879. The 
promoter, Efla, and the gene, GFP, are interrupted by 2 FRT sites, which can be joined 
20 by the FLP-mediated recombination. PA stands for poly A; BFP for blue fluorescent 
protein. 

FIGURE 19B shows the structure of a ere substrate plasmid, ad2204. 

FIGURE 20 shows the structures of several FLPe anti-sense plasmids. 

FIGURE 21 is a panel of photomicrographs showing inhibition of FLP enzyme 
25 activity by anti-sense FLP. 293 cells were transfected with FLP substrate (Figure 12) 

and plasmids indicated in each photo. High GFP intensity indicate the higher expression 
of FLP and less inhibition by the anti-sense expressed. 

FIGURE 22 shows a schematic diagram of FRT/Cre and loxP/FLP adenovirus. 

30 
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Detailed Description 
Described herein are systems for the regulated self -rearrangement of DNA 
vectors, for example, gene therapy vectors. Such regulated self-rearrangement has the 
potential to prevent unwanted expression of vector genes not required for a therapeutic 
5 effect, and to allow the stable association of the therapeutic gene with the target cell. 

The essential elements of the regulated DNA rearrangement system are a gene 
which encodes one or more proteins which induce DNA rearrangement, a method for 
regulating the activity of those proteins or their abundance, and a target DNA sequence 
on which those proteins act. Particularly desirable are methods for regulating the 

10 activity of the proteins or their abundance which can be easily carried out on an intact 
organism, such as administration or withdrawal of a drug, hormone, or environmental 
stimulus such as heat or irradiation, which induces the activity or abundance of the 
proteins which cause DNA rearrangement. 

Especially desirable are regulated DNA rearrangement systems in which all of 

15 the components can be delivered in a single vector. An example of this is a virus which 
bears both the cis-acting sequences for DNA rearrangement as well as the protein or 
proteins which act on those sequences, and the regulatory apparatus which controls the 
activity or abundance of those proteins. However, it is not necessary that the different 
elements be encoded in a single nucleic acid. 

20 The important elements of this strategy are: the compromise of vector gene 

function by regulated rearrangement of DNA topology, the generation of plasmid circles 
from vector DNA in a regulated manner, and the removal of enhancer or promoter 
elements from the vector DNA by regulated excision. It is also important that the 
circular DNA generated by site-specific recombination possesses a mechanism for stable 

25 association with the host genome in some form, here conferred by the EBV replicon. In 
other embodiments, the circular DNA might possess the ability to direct its integration 
into the host chromosomes by a site-specific integration. Site-specific integration into 
the host chromosomes may also be generated by the action of a regulated site-specific 
recombinase on a linear template without passing through a circular intermediate. 

30 Also described herein is one particular self-rearranging vector that begins as a 

hybrid adenovirus vector which is capable of converting itself into two unlinked 
molecules, a circular and a linear DNA. After this event the linear DNA product is 
deleted for two important cis-acting sequences: the packaging signals, which are 
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required for insertion of the viral DNA into the viral capsid, and the enhancer, v^^hich 
increases the expression of other promoters encoded in the viral DNA. The remaining 
linear DNA is thereby compromised in its ability to express adenoviral genes, endowing 
the vector with a lower immunological profile. The circular DNA generated by the 
5 excision event is a mammalian cell plasmid which has the capacity to persist by 

autonomous replication in the nucleus. This capacity is encoded in genetic elements 
derived from the Epstein Barr virus (EBV). A schematic diagram of such a vector is 
illustrated in Figure 5 . 

Epstein Barr virus is a human herpes virus which is the etiologic agent of 

10 infectious mononucleosis and which has been implicated in the genesis of Burkitt's 

lymphoma, a B cell neoplasm, and is thought to be a predisposing factor for some forms 
of nasopharyngeal carcinoma. Approximately 85% of the adult Western population has 
a persistent population of B cells which contain a circular latent form of the viral- 
genome, maintained in cells by the action of Epstein Barr Nuclear Antigen 1 (EBNAl), 

15 a DNA replication protein that acts on the viral latent phase origin of replication, OriP. 
EBNAl in and of itself is not thought to promote neoplasia; current thinking places 
greater weight on the actions of the EBNA2 proteins and LMP, latent membrane protein, 
in the inception of EBV-associated neoplasm. 

Mammalian cell plasmids have been created which bear the EBNAl gene and 

20 OriP. In nonrodent cells, these plasmids persist by replication with each transit of the 
cell cycle. Multiple transcription units can be borne by these plasmids, allowing 
regulated expression of diverse gene products. 

Preferred adenoviral vectors, shown in Figures 1 A and IB, are linear forms of an 
EBV plasmid flanked by loxP sites, cis-acting sequences required for site-specific 

25 recombination directed by the bacteriophage PI ere protein. To prepare an adenovirus 
bearing both the ere protein and loxP sites, it is necessary to insure that the ere protein is 
not expressed while the vector is being propagated in 293 cells. To lower the 
immunological profile of the vector, it is also desirable that the ere protein not be 
expressed after the vector delivered its payload to the target cell and die ere protein 

30 performed its function. 

To accomplish these objectives, two general approaches have been developed for 
the production of adenoviral chromosomes that circularize following the regulated 
expression of site-specific recombinases. In each case, the vector is engineered to allow 
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for the production of viruses in 293 cells, and to provide transitory expression of 
recombinase that induces rearrangement in target tissues. The major difference between 
the two strategies lies in the way the deinduction of recombinase is achieved. 

In the first approach, adenoviral vectors are engineered to turn an activating 
5 transcription factor into a repressor upon chromosomal rearrangement. Vectors 

employing this approach are referred to herein as type A vectors (Figure lA). In the 
second approach, the recombinase promoter is redirected following chromosomal 
rearrangement. Vectors utilizing the second approach are referred to as type B vectors 
(Figure IB). In both cases a linear chromosome is converted to its circular episomal 

10 form and a resulting deleted linear form. The circular DNA contains an Epstein Barr 

virus (EBV) replicon, which allows synchronous replication of the episome with the host 
mitotic cycle (Reisman et al., Mol. Cell Biol. 8: 1822-32, 1985; Yates et al.. Nature 313: 
812-15, 1985). The linear DNA is deleted for the enhancer and El genes. 

One self-regulated gene switch, employing the type A vector strategy, was 

15 designed based on the bacterial transposon TnlO tetracycline repressor (tetR) gene. In 
its natural context, the tetR protein binds to specific sequences (tet operator sequences) 
upstream of a tetracycline resistance gene, preventing transcription of the gene unless 
tetracycline is present. To adapt this protein for eukaryotic gene regulation, a gene 
fusion is created between tetR and an active portion of a strong eukaryotic 

20 transcriptional activator, the herpes simplex virus VP16 protein. The fusion protein 

exerts its action on a synthetic promoter created by the insertion of multiple tet operator 
sequences upstream of a basal promoter element. This configuration allows high-level 
gene expression whenever the tetR-VP16 fusion protein binds to its cognate operator 
sequences. Because the tetR protein normally does not bind to its operator in the 

25 presence of tetracycline, the activity of this synthetic promoter is high in the absence of 
tetracycline and low in its presence. 

One example of a type A vector is shown in Figure 1 A. This self-regulated gene 
expression cassette, present in a hybrid adenovirus, consists of a bi-directional promoter 
element in which central tetR binding sites are flanked by divergently oriented basal 

30 promoter elements. In one direction the promoter directs the formation of a transcript 
encoding the ere protein; in the other direction, the promoter directs the formation of a 
tetR~VP16 fusion protein. The latter differs from the conventional version in bearing a 
loxP site between the tetR component and the VP 16 component. When tetracycline is 
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present this gene switch is silent. As shown in Figure 1 A, upon introduction into a 
target cell in the absence of tetracycline, the tetR-loxP-VP16 fusion protein is produced, 
stimulating further production of the fusion protein, and the ere protein. The ere protein 
then acts to promote site specific recombination between the loxP site in the tetR-loxP- 
5 VP16 coding sequences, and a distant loxP site. As a result of this recombination, the 
fusion protein coding sequence is disrupted so that the promoter no longer directs the 
formation of a tetR-loxP-VP16 fusion protein, but gives rise to an inert tetR-loxP-VP16 
fusion protein for binding to the promoter upstream elements, thereby extinguishing 
promoter activity. 

10 As shown in Figure lA, the excised circular DNA element contains at least two 

transcription units, hi addition, other transcription units or internal ribosome entry site 
elements may be used to allow the coexpression of gene products which are useful for 
extending the persistence of the delivered DNA, regulating expression of the gene of 
interest, or providing for ablation of the transduced cells once their presence is no longer 

15 desirable. In addition, the linear DNA remaining after excision of the circular gene 
expression plasmid lacks both viral packaging sequences and the cis-acting enhancer. 
Within this linear DNA, additional loxP sites may be placed to provide for the 
rearrangement of the remaining vector DNA in the target cell, disrupting the normal 
topology of the genes, and further thwarting expression. 

20 Using the type B vector design strategy, described in greater detail below, a 

recombinant adenoviral gene delivery system that is capable of undergoing growth 
phase-dependent site-specific recombination has also been constructed^ 

The following examples are presented for the purpose of illustrating, not 
limiting, the invention. 

25 

TYPE B VECTORS - EXPERIMENTAL RESULTS 
Several experimental examples for constructing type B vectors and for carrying 
out the general approaches of the invention are now described below. 



30 Two-Cosmid System for Efficient Construction of Recombinant AdV 

To simplify and facilitate the generation of recombinant AdV, a system was 
established to assemble the desired AdV genome in a single plasmid by ligation (shown 
in Figures 2A, 2B). The system consists of two component vectors, a left end plasmid, 
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pLEP, and a right end plasmid, pREP. The left end Ad sequences (nt 1-376) in pLEP 
include the viral inverted terminal repeat, the cis-acting packagmg sequences, and the 
viral enhancer. Nucleotide (nt) positions described herein refer to the wild type Ad2 
sequence in GenBank (J019017). The Ad sequences are followed by the gene 
5 expression unit intended for delivery and an intron endonuclease (PI-PspI) cleavage site. 
The right end plasmid contains a PI-PspI site followed by the Ad2 genome from the end 
of the El locus rightward (nt 3527-35937). 

pLEP is a small tractable vector for cloning, whereas pREP is much larger and 
contains less frequenfly manipulated genes. Both pUEP and pREP contain a 

10 bacteriophage X cos site, oriented to generate a single cosmid of appropriate length for in 
vitro packaging following ligation of the two plasmids at the PI-PspI cleavage site. 
pLEP is tetracycline resistant (Tet^) and pREP is ampicillin (Amp^) resistant, allowing 
the recombinants to be selectively isolated by co-selection for both markers. In the 
resulting assembled cosmid, the adenoviral sequences are closely flanked by cleavage 

15 sites for the intron endonuclease I-Ceul. Digestion with I-Ceul liberates the entire 
recombinant AdV genome from the parent cosmid (see Figure 2B). 

Three classes of pREP have been constructed to allow the preparation of 
AdVs bearing El (pREPV; SEQ ID NO.: 2), El and E3 (pREP8; SEQ ID NO.: 3), or El, 
E3, and E4 (pREP12; SEQ ID NO.: 4) deletions. pREP7 (SEQ ID NO.: 2) contains nt 

20 3527-35937 of the Ad2 genome, and pREPS (SEQ ID NO.: 3) carries an additional 
deletion in the E3 region (A nt 27901-30841). pREP12 (SEQ ID NO.: 4) has deleted 
open reading frames (ORF) 1-4 of the E4 region (A nt 34121-35469, 1348 bp). AdV 
generated with these cosmids should be able to accommodate 5, 8, and 10 kb inserts, 
respectively. 

25 These aforementioned vectors were constructed as follows. The EcoRI to Bsal 

fragment that spans the ampicillin resistance gene in pBR322 was deleted and replaced 
by a sjnathetic adapter, and the bacteriophage X cos site was inserted between the unique 
Styl and BsmI sites. A PGR amplified Ad2 fragment containing the left end ITR 
(L.ITR), enhancer elements, and the encapsidation signal (nt 1-376) was created and 

30 inserted into the adapter (Figures 2A, 2B) to yield the tetracycline-resistant left-end 

plasmid pJJEP. The right end of Ad2 from the AflH site to the right end (nt 3527-35937) 
was assembled into an ampicillin resistant cosmid vector, pACKrr3 (SEQ ID NO.: 1), by 
multiple steps of PGR amplification and fragment interchange. The resultant cosmid 
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was termed pREP7 (SEQ ID NO.: 2). To expand vector capacity, two deletions were 
incorporated into the pREP7 (SEQ ID NO.: 2) cosmid, an E3 gene deletion (nt 27901- 
30841, 2840 bp); cosmid pREP8 (SEQ ID NO.: 3) and a 1.3 kb deletion (nt 34121- 
35469) in the E4 region of the Ad2 region; pREP12 (SEQ ID NO.: 4) 
5 An example of the construction of an AdV carrying a CMV-GFP expression unit 

is outlined in Figure 2. pLEPCMVGFP (Tef ) was digested with PI-PspI and ligated to 
the pREP7 (SEQ ID NO.: 2; AEl, Amp'') digested with the same enzyme. The ligation 
mixture was packaged with X phage extracts (MaxPlax lambda packaging extracts. 
Epicentre Technologies) and a fraction of the packaged phage was used to infect a 

10 recombination-deficient E, coli host, with selection for the assembled plasmid on 

Amp/Tet plates. Transductants containing pLEP fused to pREP were selected on agar 
containing 25 ju,g/ml ampicillin and 12.5 /xg/ml tetracycline (Amp/Tet). Colonies were 
selected and DNA isolated (Qiagen). DNA was used either for restriction analysis or for 
tranfection of 293 cells as described herein. 

15 Figure 3 A shows typical results for the Bgl II digestion pattern of a 

pLEP3CMVGFP/pREP7 hybrid cosmid, pAd2-7CMVGFP DNA. Because of the size 
minimum (-40 kbp) for X phage in vitro packaging and the double antibiotic selection, 
most of the colonies growing on Amp/Tet plates were the desired hybrid cosmids, and 
undesired rearrangements were rarely seen. In the present example, all four pAd2- 

20 7CMVGFP clones exhibited the digestion pattern predicted from the inferi'ed sequence. 
The entire recombinant AdV genome was then released from the cosnaid by I-Ceul 
digestion (Figure 3B). I-Ceul digestion leaves ten nucleotides to the left of the left ITR 
and eight nucleotides to the right of the right ITR. Short flanking sequences have been 
reported to be eliminated during replication of recombinant viruses after transfecting the 

25 DNA into 293 (human embryonic kidney) cells (Hanahan et al., Mol. Cell. Biol. 4:302- 
309, 1984). 

The digestion reaction can be transfected into 293 cells without purification as 
follows. 293 cells, obtained from Microbix Bisosystems (Ontario, Canada), were 
cultured in 10 cm dishes in complete Dulbecco's Modified Eagle's Medium (DMEM) 
30 supplemented with 10% FBS, 2mM glutamine and penicillin/streptomycin (Gibco BRL), 
and maintained at 37 °C and 5% CO2 atmosphere incubator. Cells were grown to -50% 
confluence on the day of transfection. Ten fig of cosmid DNA were digested with I- 
Ceul in a volume of 50 jxl. The reaction mixture was transfected into 293 cells by 
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calcium phosphate precipitation (Graham and Prevec, Manipulation of adenovirus 
vectors, p. 109-128, In E. J. Murray (ed.), Methods in Molecular Biology, vol. 7, 
Humana, Clifton, NJ, 1991) without purification. After transfection, cells were cultured 
and examined daily for the appearance of cytopathic effects (CPE). Virus propagation, 
5 purification, plaque assay, and viral DNA isolation were performed using established 
protocols (Graham and Prevec, supra). At day six post-transfection, 5-30 viral 
plaques/10 cm dish /lO /xg DNA were usually apparent, which compared favorably with 
the 30-50 plaques/10 cm dish 710 jLtg DNA found for 293 cells transfected with purified 
wild type Ad2 DNA. 

10 To compare the efficiency of recombinant virus production, similar viruses were 

also generated by homologous recombination. 20 ptg of pREP7 (SEQ ID NO.: 2) was 
co-transfected into 293 cells with 10 jUg of a plasmid encoding the left end of the 
adenoviral genome and a green fluorescent reporter gene (pLITREFlaGFP). 
pLITREFlaGFP contained the Ad2 left end nt 1-376, an EFla promoter/GFP 

15 expression unit and Ad2 sequence (from 3525-8120) that overlaps with the same 

sequence in pREP7 (SEQ ID NO.: 2). This overlap fragment served as the region for 
homologous recombination. Each co-transfection was performed in duplicate. Initial 
plaques took longer to appear (14 days post transfection) and were less abundant (0-3 
plaques per plate). 

20 Data in tlie literature suggest that exposed ITR ends favor efficient virus 

production (Hanahan et al., supra). To assess the importance of this effect, an AdV 
cosmid, pIAdEFlaGFPB, in which the AdV ITRs were flanked with a different 
restriction site at each end was constructed. pIAdEFlaGFPB DNA was digested with 
BsaBI to expose the right ITR, I-Ceul to expose the left ITR, or the two enzymes were 

25 used together to expose both ends. Digested cosmid DNA samples were transfected into 
293 cells and plaques were allowed to develop. Virus propagation, purification, plaque 
assay, and viral DNA isolation were performed using established protocols described in 
Graham and Prevec. (Manipulation of adenovirus vectors, In E. J. Mun'ay (ed.), 
Methods in Molecular Biology, vol. 7. Humana, Clifton, NJ., pp. 109-128, 1991). 

30 Ten days after transfection the viruses were harvested and viral titers were 

determined. The average titer for the viral stocks (Figures 4A, 4B) was 1.3 X 10" pfu/ml 
from transfection with undigested DNA; 2.4 x 10^ pfu/ml from BsaBI linearized DNA 
(firee rigjit ITR); 1.1 x 10^ pfii/ml firom I-Ceul linearized DNA (free left ITR); and 2.7 x 
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10^ pfu/ml for the BsaBITI-CeuI double digested DNA (both ITRs free). Thus liberation 
of each end resulted in an approximate increase in the efficiency of generating virus by a 
factor of ten (Figures 4A, 4B). 

5 Construction of an AdV Capable of Self-Rearrangement 

One approach to attenuating adenoviral gene expression and improving transgene 
persistence is the creation of viruses capable of undergoing internal, self-directed 
rearrangement upon delivery to the target tissue. In principle, this objective can be 
achieved through the regulated expression of site-specific recombinases in vectors that 

10 contain the cis-acting target of recombinase action. To allow such vectors to be created, 
the recombinase activity must be suppressed during propagation in the packaging cell 
line. As described in more detail below, the use of a lineage-specific promoter to control 
recombinase expression has been successfully employed to achieve this end. 

An example of this is shown in Figure 5. The expression of Cre recombinase 

15 was controlled by a liver-specific promoter constructed as follows. The human hepatic 
control region 1 and 2 (HCRl and 2) of the ApoE/C gene locus (Allan et al., J. Biol. 
Chem. 270:26278-81, 1995; and Dang et al., J. Biol. Chem. 270:22577-85, 1995) were 
amplified by PCR using 293 cell genomic DNA as the template. The following primers 
were used to amplify both HCRl and HCR2 fragment: HCRtop- 

20 5'gcggaattcggcttggtgacttagagaacagag 3' (SEQ ID NO.:5); HCRbot - 5' 

gcgggatccttgaacccggaccctctcacacta 3' (SEQ ID NO.:6). The ampliiBed PCR fragments 
(-0.39 kb) were cloned into pUC19. The HCRl and HCR2 sequences were confirmed 
by dideoxy DNA sequencing. The two fragments were assembled in a head to tail 
orientation, fused with a synthetic basal TATA element and cloned in a parental pLEP 

25 vector containing a GFP reporter gene. The resultant plasmid was named 

pLEPHCR12GFP. The synthetic liver-specific, as demonstrated below, provided a 
means to control Cre recombinase expression during propagation of the vector in 293 
cells, and allowed for testing the consequences of abstracting the enhancer from the 
linear vector DNA upon delivery of the DNA to the target cells. 

30 In 293 cells, this promoter is silent, allowing the viral chromosome to be 

propagated with minimal rearrangement. Any rearranged viruses that are formed lack 
packaging signals and so disappear from the pool of propagating vectors. In liver cells 
the Cre recombinase is induced by the action of the tissue-specific promoter. The 
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resulting Cre-induced recombination excises a circular episome and redirects the 
transcriptional output of the liver-specific promoter so that it directs the synthesis of the 
transgene of interest. The remaining linear fragment consists of an adenoviral genome 
lacking the enhancer and packaging signals and a Cre expression unit devoid of 
5 promoter sequences. 

In the form discussed here, one loxP site is located at nucleotide 147 of the Ad2 
genome, between the left ITR and the enhancer/packaging sequences, and the second 
loxP site is placed inside an intron a few bases upstream of the splice acceptor sequence. 
Hence the loxP site does not appear in the resulting mature transcript. The Cre coding 

10 sequence that remains on the right end linear fragment after rearrangement lies 

downstream from a splice acceptor that lacks a splice donor or upstream promoter 
sequences. This effectively terminates the expression of Cre following excision. 

Prior to recombination, the Cre recombinase gene is under the control of a 
synthetic promoter (referred to as HCR12), consisting of hepatic locus control elements 

15 from the human ApoE/C locus fused to the first intron of the human EFla gene. After 
cyclization the HCR12 promoter lies upstream of the transgene (in this case GFP) and 
the distal segment of the intron (beyond the loxP site) contains the adenoviral enhancer. 
To facilitate manipulation of the plasmids in E, colU the human IgGl hinge-CH2 intron 
(118 bp) was inserted in the Cre coding sequence at nucleotide 237, suppressing Cre 

20 expression in bacteria. The circularized episome contains the latent origin of replication 
(OriP) and trans-acting DNA replication protein (EBNA-1) of Epstein Barr virus, and 
hence is capable of autonomous replication in synchrony with the host mitotic cycle 
(Yates et al.. Nature 313:812-815, 1985). 

Using the two cosmid system described above, the pLEP plasmid containing the 

25 self-resolving components, pLEPlBHCR12, was ligated witli pREP8 (SEQ ID NO.: 3; 
AE1AE3) to create pAdVHCRGFP/EBV. The latter was digested with I-Ceul and 
transfected into 293 cells. Appearance of plaques from AdVHCRGFP/EB V was 
retarded (by 8 days) compared to non-rearranging viruses, perhaps as a result of basal 
expression of the liver-specific promoter in 293 cells. However high titer viral stocks of 

30 10^^ nominal (absorbance-determined) particles/ml was achieved. 
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Rearrangement in Target and Nontarget Cells 

To test excision efficiency, HepG2 (hepatocellular carcinoma) and Hela (cervical 
carcinoma) cells, obtained from ATCC, were infected with virus at a multiplicity of 
infection (moi) of 1,000 nominal particles/cell. This titer corresponds to approximately 
5 10 plaque forming units per cell. For these experiments, HepG2 and Hela cells were 
seeded in 35mm dishes and cultured to approximately 80% confluence in DMEMTFBS 
as described herein. Cells were infected with the desired multiplicity of virus in a 
volume of 1 mJ at 37 °C for 2 hours. At the end of the incubation, cells were washed 
with PBS twice and cultured in 2 ml of medium. Cells were collected in parallel at 

10 desired points for low molecular weight DNA and RNA extraction. Cells were 
examined for GFP expression by fluorescence microscopy (Olympus, 1X70) or 
microtiter plate reader (PerSeptive Biosystem, CytoFIuor II) before extraction of DNA 
for analysis of chromosomal rearrangement. 

DNA analysis of chromosomal rearrangement was performed as follows. 5 fig 

15 of Hirt DNA was digested with Bgl 11 and analyzed by DNA blot techniques using a 

labeled EBNA-1 gene fragment as probe (Figure 6). The Bgl n fragment from the non- 
circularized AdV is 3162 bp, generated from the 5'end of the AdV to the first Bglll site 
in the AdV. The circularized fragment created from the two loxP sites has a size of 
4915 bp (Figure 6A). Densitometry revealed that at 72 hours post infection, 95% or 

20 more of the input genomes had undergone circularization in HepG2 cells. In contrast, 

low but detectable levels of circularized fragment was visualized in Hela cells infected at 
the same time and at the same multiplicity of infection used for the HepG2 cells (Figure 
6B). 

At the time of infection (t=0, Figure 6B), the amount of input viral DNA detected 
25 by DNA blot was higher for HepG2 cells than for Hela cells when similar virus 
multiplicities were applied (moi of 1,000). This may reflect differences in AdV 
adsorption or infection efficiency between the two cell types, possibly as a result of the 
lower levels of coxsackievirus-adenovirus receptor on the Hela cells surface. To achieve 
similar viral genome input into HepG2 and Hela cells, Hela cells were infected with ten- 
30 fold more vkus (moi of -10,000) than HepG2 cells (moi of 1,000). Episomal DNA 

samples were extracted and analyzed by blotting. The results (Figure 6C) indicated that 
when comparable amounts of viral genome are present in the nucleus, the cyclization 
rate in both cell types was similar. Because the level of subsequent GFP expression is • 
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much higher in HepG2 cells than in HeLa cells (Figure 7A), it is likely that very small 
amounts of Cre recombinase suffice to promote rearrangement, and that recombinase 
expression is not limiting for rearrangement in either HepG2 or HeLa cells. 

GFP expression cannot be detected until rearrangement has taken place, so the 
5 measurement of the fraction of GFP positive cells provided a simple alternate method 
for assessing the degree of productive rearrangement. Figure 7A shoves that GFP 
expression developed quickly in transduced HepG2 cells, but that only a few GFP 
positive cells can be detected in Hela cells infected with a ten fold higher moi, 
conditions that allow circularization to a comparable extent to that seen in HepG2 cells 
10 (Figure 6C). 

The HCR12 promoter specificity was also tested by infecting two additional non- 
hepatic cell lines, A431 (human epidermoid carcinoma) and HT29 (human colon 
adenocarcinoma), with the Ad2HCRGFP/EBV vector. Both cell lines were obtained 
from ATCC and cultured using DMEM/FBS as described herein. A few cells, with 

15 weak GFP signal, were detected at 72 hours after infection in these cells (Figure 7B). In 
contrast, these non-hepatic cells could be infected efficiently with a first generation 
AdV, Ad2CMVGFP virus (data not shown), indicating that the low GFP signal was not 
due to the low infectivity of these cells' by AdV. 

To further assess the utility of the AdV genome rearrangement, primary human 

20 hepatocytes were infected with the Ad2HCRGFP/EBV vector. For these experiments, 
primary human hepatocytes, generously provided by Dr. Albert Edge (Diacrin, Inc., 
Charlestown, MA) were isolated and cultured as described by Gunsalus et al. (Nat. Med. 
3:48-53, 1997), infected with adenovirus, and GFP expression was analyzed. As shown 
in Figure 7C, GFP expression was readily detected 72 hours after infection. 

25 

Diminished Viral Gene Expression in Rearranged AdV 

After excision, the adenovirus major enhancer/packaging signal segregates with 
the episomal DNA, yielding a linear fragment containing the remainder of the AdV 
genome without this important cis-element (Figure 5). To assess the impact of enhancer 
30 deletion, PGR amplification and quantitative RT-PCR measurement of late viral gene 
expression was performed as follows. 
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Four jiig of total RNA was reverse transcribed into cDNA using M-MLV RT by a 
standard protocol (Promega). 1 [il of the cDNA from each sample was used in 
subsequent PGR reactions. PGR primers were designed to amplify the tripartite leader 
sequence of the adenovirus late genes: TPLl - 5' act etc ttc cgc ate get gt 3' (SEQ ID 
5 NO.: 7) and TPL2 - 5' ctt gcg act gtg act ggt tag 3' (SEQ ID NO.:8). For detection of the 
AdV genome in the Hirt DNA samples, 1 |Lig DNA was employed in the PGR 
amplification using the following primers which are specific for the adenovirus DNA in 
the fiber gene: Fiberl - 5* ccg cac cca eta tct tea ta 3' (SEQ ID NO.: 9) and Fiber2- 5' ggt 
gtc caa agg ttc gga ga 3' (SEQ ID NO.: 10). PGR reactions were performed as 95 °G 30 

10 seconds; 54 °C 30 seconds; 72 °C 30 seconds for 30 cycles. All amplified products were 
analyzed on a 2% agarose gel. 

For quantitative PGR, a molecular beacon based universal amplification and 
detection system was used (Intergen). A common leading sequence (Z sequence, 5* act 
gaa cct gac cgt aca 3') was added to the TPLl and Fiberl primers. The TPL2 and Fiber2 

15 primers, described above, were used in the quantitative PGR reactions. 1 \il of the 
cDNA and one (ig of Hirt DNA from each sample were used in the assay. The PGR 
were carried out in a 96-well spectrofluorometric thermal cycler (Applied Biosystems 
Prism 7700). The number of template molecules in the PGR reaction was calculated 
from the standard curve using linearized plasmid as templates. 

20 As most late adenoviral genes transcripts share a conamon -200 bp tripartite 

leader sequence (TPL) (Akusjarvi and Persson, Nature 292:420-6, 1981), the TPL 
sequence was chosen as a marker of viral gene expression. HepG2 cells were infected 
with the first generation vectors Ad2CMVGFP and Ad2HGRGFP, or the self-resolving 
vector, Ad2HCRGFP/EB V, using increasing multiplicities of infection. Total cellular 

25 RNA and low molecular weight DNA were isolated in parallel as described by Hirt (J. 
Mol. Biol, 26:365-9, 1967) and total RNA was prepared using RNAzol solution (Tel- 
Test. Inc.). RT-PGR was performed to quantitate the amount of RNA encoding the TPL 
in the cDNA samples. PGR amplification of a 201 bp fiber gene fragment from the AdV 
genome was used to detect the amount of viral genome in the DNA samples. A 

30 representative result of three experiments is shown in Figure 8A. TPL sequences were 
detected, 72 hours post-infection, with either 100 or 1000 viruses infected per cell, using 
both of the first generation adenoviruses (upper panel). 
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In contrast, no TPL signal was detected in the self-resolving Ad2HCRGFP/EBV 
infected cells, even at a moi of 100,000/cell. PGR amplification of the AdV fiber gene 
revealed comparable levels of AdV genomic DNA in cells infected at comparable moi's. 
(Figure 8 A, lower panel). The cDNA samples in which the TPL signals were detected 
5 were further analyzed by real-time fluorescence PGR. The corresponding genomic 
DNA samples were also analyzed to determine the number of AdV genomes present in 
each sample. The results are sunomarized in Figure 8B. There were approximately 
1x10"^ TPL per 1x10^ AdV genomes detected in the Ad2HGRGFP infected cells, but no 
detectable TPL in the self-resolving Ad2HCRGFP/EBV infected cells. These results 
10 indicate that adenoviral gene expression was dramatically reduced by the separation of 
the viral enhancer sequences occasioned by the re-arrangement of the self-resolving 
vector. 

TYPE A AND TYPE B VECTORS - EXPERIMENTAL RESULTS 

15 Additional experimental examples now follow that further illustrate the general 

approaches of the invention relating to using and constructing type A and type B vectors. 
For generating such adenoviral vectors, DNA sequences important for gene expression 
in the target tissue were placed between two loxP sites. The first loxP site was inserted 
between the Ad2 left-end inverted terminal repeat (ITR) and the enhancer sequence, 

20 replacing a BspLUl II and BstZ17 fragment of Ad2. A target gene expression cassette, 
comprising a promoter, a gene of interest, polyadenylation signals, the EB V repUcon, 
and site specific recombinase expression unit were inserted in place of the El locus. 

In type A adenoviral vectors, the second loxP site is placed between TetR and 
VP16, preserving the coding frame of both (Figure 1 A). A bidirectional promoter in 

25 which a central heptamer of tetracycline operator sites (TetO) (Gossen and Bujard, Proc. 
Natl. Acad. Sci. USA 89:5547-5551, 1992) was flanked by two divergently oriented 
basal elements, directs the expression of TetR loxP VP16 from a synthetic TATA 
element, whereas Cre recombinase is controlled by the same heptamer of operator 
upstream of the HIV LTR basal element. 

30 In the case of type B viruses (Figure IB), the second loxP site was inserted in the 

first intron of the Efla gene, which contains the transcription stimulating sequences 
described herein. In addition, a splice acceptor sequence was added to the 5' end of the 
coding sequence of the gene of interest. To avoid rearrangement during plasmid 
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consti'uction in bacteria, the Cre recombinase coding sequence was interrupted by the 
addition of the human IgGl hinge-CH2 intron (between amino acids Q78 and A79), as 
described herein. 

5 Designing a Compact EB V Replicon 

Most plasmids employing the EBV latent origin of replication exceed 10 kb in 
length. To provide a means for increasing the capacity of the recombinant adenoviral 
type A or type B vectors to accommodate a therapeutic gene, a compact EBV replicon 
having episomal stability was designed. To this end, deletions were generated in both 

10 the cis-acting origin of replication, OriP, and the sequences encoding the trans-acting 
replication protein, Epstein Barr virus nuclear antigen- 1 (EBNA-1) (Figure 9A). 
Episomal persistence was assessed with a green fluorescent protein (GFP)-bearing test 
plasmid by determining the fraction of cells retaining green fluorescence as a function of 
time, assuming that the half-life of GFP, in daughter cells that have not received an 

15 episome as a result of segregation failure, is approximately 1.4 days (Fukumura et al., 
Cell 94:715-725, 1998). 

EBNA-1 contains a central repeated structure that consists entirely of Gly and 
Ala residues, termed the GA repeats (Figure 9A). Although deletion of this structure has 
been reported to have little consequence, a deletion mutant consisting of both a short 

20 OriP and a short EBNA-1 (SoriP + SEBNAl) was generated and found not to support 
plasmid maintenance effectively (approximately 40% loss per cell division). A version 
of this mutant, reconstructed with 40 GA repeats, in which the short OriP was paired 
with a short EBNA-1 provided significantly better plasnaid stability (20% loss per cell 
division vs. 10 % per cell division for the wild type) (Figure 9B). Since most target 

25 tissues are relatively quiescent mitotically, this level of segregation fidelity provides 
reasonable stability in a compact replicon. 

Producing Cell Lines that Express Cre- or FLP-Dominant Negative Mutants 
As discussed herein, one obstacle to creating adenovirus carrying both 
30 recombinase and target sites has been the difficulty of controlling recombinase activity 
during virus propagation. Since efficient recombinase activity is needed in target cells, 
recombinase activity is best tempered in the production cell line. 
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Vector-independent methods to suppress recombinase activity during the 
production phase are attractive because they allow vector design objectives to 
be pursued with fewer constraints. In principle, dominant negative recombinase mutants 
provide the desired antagonism of recombinase activity. Cell lines expressing such 
5 recombinase dominant negative mutants were produced as follows. 

Dominant negative Cre mutants were selected from known point mutants 
(Wierzbicki et al., J. Mol. Biol. 195:785-794, 1987) that are defective in recombination 
function but are likely to retain dimerization function Figures lOA-E. Several mutants 
were screened for their abilities to inhibit Cre activity of a type B vector construct 

10 (ad2239 in Figure lOB) in a transient cotransfection assay. Under these conditions, Cre 
activity is detected by the expression of GFP that occurs upon rearrangement Figures 
lOD and lOE show the point mutants that were assessed and their relative activities in 
the transient cotransfection assay. Dilution studies, in which increasing amounts of 
substrate/Cre plasmid were cotransfected with the mutant forms, were conducted and 

15 based on its favorable profile, one mutant recombinase, designated CreY324C, was 
chosen for further development (Figure lOD). 

Strong constitutive expression of CreY324C, under control of the Efl a promoter 
failed to yield stable cell lines. Stable clones were obtained when the Ef la promoter 
was replaced with a tetracycline regulated promoter (Gossen et al.. Science 268:1766- 

20 1769, 1995). Clones were then tested for the ability to inhibit Cre enzymatic activities, 
and one clone, designated cell line #17, was selected for additional experiments. When 
a plasmid bearing Cre and capable of undergoing Cre-directed rearrangement to create a 
GFP transcription unit (ad2239) was transfected to #17 cells or parental 2930N cells, 
GFP expression in the #17 cells in the presence of 2\xM doxycycline was significantly 

25 lower than those of controls (Figure 11), showing that Cre enzyme activity can be 
inhibited in #17 cells. 

In addition to dominant negative Cre mutants, dominant negative FLP mutants 
may also be identified. FLP belongs to the same family of site-specific recombinases as 
Cre recombinase. A number of FLP mutations that show defects in either cleavage or 

30 ligation of FRT sites have been identified. Mutant FLP defective in cleaving FRT site 
(for example, H309L, L315P, G328R, G28E, N329D, S336Y, S336F, A339D, Y343F, 
and H345L) are generated using standard methods. Mutants that inhibit the wild type 
enzyme are then identified for generating stable cell lines according to the methods 
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described above. These and the other cell lines (described herein) are then used for 
producing FRT/FLP containing virus. 

As mentioned above, difficulties creating stable cell lines expressing Cre 
dominant negative mutants were occasionally encountered. This difficulty is not limited 
5 to Cre mutants, but also to the wild-type Cre enzyme. In contrast, 293 cell lines stably 
expressing a thermostable FLP, referred to as FLPe (Buckholz et al., Nat. Biotechnol. 
16:657-662, 1998), were created, suggesting that FLPe might not be as cytostatic as Cre 
protein. To demonstrate this, 293 cells were transfected with plasmids expressing either 
Cre or FLPe, and puromycin resistant colonies were selected. To generate stable cell 

10 lines expressing Cre or FLP mutants, 293 TetON cells were transfected with linearized 
plasmid expressing Cre or FLP mutants and puromycin acetyltransferase and selected 
with 1 [O-g/nal of puromycin. Puromycin resistant colonies were characterized further for 
their ability to inhibit Cre recombinase using the cre (ad2239) or flp (ad2879) substrate 
plasmids. Table 1 shows that there are more puromycin resistant colonies selected from 

15 FLPe transfected cells than from Cre transfected cells. From this result, it is expected 
that stable cell lines expressing a reasonably high level of dominant negative FLP may 
be readily created. 



TABLE 1 

20 



Puromycin Resistant Colonies Formed When Cre Expressing or FLPe Expressing 
Plasmid was Used to Transfect 293 Cells 



Expression Plasmid 


Number of colonies (2 M,g/ml puromycin) 


Control 


236 


Cre 


92 


FLPe 


127 



25 



Cre or Cre dominant negative mutants were also found to inhibit FLP activity 
(Table 2). Accordingly, cell lines such as cell line #17, that stably express a Cre 
dominant negative mutant (for example, CreY324C), are useful for producing FLP/FRT 
30 carrying adenovirus. 
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TABLE 2 



Cre Inhibition of PLP Activity in trans 



Plasmids 


Arbitrary GFP intensity 


Efla FLP + FLP substrate + vector control 


4.9 


Ef la KLP + FLP substrate + Efla Cre 


0.78 


Efla FLP + FLP substrate + Efla Cre R173C 


2,23 



5 

FLP enzyme activity was measured by the GFP intensity by cotransfecting with a FLP substrate 
plasmid, ad2879 (Figure 19A). GFP intensity was quantified using IP lab software. 



10 Transcriptional Regulation of Cre or FLP Recombinases 

It has been relatively difficult to achieve high-level promoter inducibility in a 
replicating adenovirus. The challenge is similar to that of achieving faithful control of 
transcription in a transient expression setting. One approach to increase the induction 
ratio in a transient setting is the use of auto-regulatory (feed-forward) circuits. One such 

15 system, based on tetracycline dependent activation, is shown in Figure 12. A central 
heptamer of tetracycline promoter operator sites (TetO sites) was placed between two 
divergently oriented basal TATA elements. The leftward TATA controls the expression 
of the TetR-VP16 fusion protein, in which a loxP (or FRT) site has been placed between 
the TetR DNA binding domain and the VP16 transcriptional activator. The rightward 

20 TATA box directs the synthesis of recombinase, either Cre or the yeast FLP enzyme. In 
the presence of tetracycline, the promoter has reduced activity in both directions. Upon 
removal of tetracycline, the synthesis of both TetR-VP16 and recombinase are induced 
(Figure 1 A). The induced recombinase then disjoins the TetR DNA binding element 
from the transcriptional activation contributed by VP16. Any existing TetR-VP16 

25 fusions thereafter promote transcription of TetR, which competes with TetR-VP16 for 
TetO, resulting in deinduction of recombinase transcription. 

When a model target cell line, HepG2, was tested with this type of adenovirus, 
the efficiency of circularization was low relative to that seen in 293 cells (data not 
shown), indicating a cell dependence of tlie bidirectional TetO promoter. To correct 

30 this, the TATA element of the TetO synthetic promoter (derived from the CMV 

inmiediate early promoter) was replaced with that of the HIV LTR. Constructs bearing 
differing components of the HIV basal promoter were analyzed for strength and 
regulation in 293 and HepG2 cells (Figure 13A). Among the constructs tested, one 
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version bearing the HIV LTR TATA and Spl elements (D in Figure 13 A) show^ed the 
least basal expression in 293 cells (data not shown) and the greatest induction in HepG2 
cells (Figure 13B). 

Using this promoter, a construct (ad3400) containing the autoregulatory 
5 structure D as shown in Figure 13 A was engineered, and Cre activities in the presence 
and in the absence of tetracycline were assayed. Plasmid ad2265 (Figure 14) in which a 
blue fluorescent protein (BFP) expression unit is interrupted by two loxP sites and 
transcription termination sequences was used as a substrate for Cre. Cre-mediated 
recombination joins BFP to the promoter resulting in BFP expression. As shown in 
10 Table 3, no difference was found in the intensity of BFP expression, either in the 

presence or absence of tetracycline. One possible explanation for this is that very little 
Cre protein is required for activity. Consistent with this idea, standard 
imunohistochemical techniques failed to reveal the presence of Cre enzyme in cells that 
were fully induced (data not shown). 

15 

TABLE 3 

Cre Recombinase Activity Regulation in Type A Constructs 

20 



Construct 


Cre Form 


+tet -tain 


-tet -tarn 


+tet +tam 


-tet +tain 


ad3400 


Cre 


2.58 


3.39 


2.71 


6.20 


ad4394 


Cre-LBD 


0.081 


0.22 


0.056 


1.02 


ad4705 


LBD-Cre-LBD 


ND 


ND 


ND 


ND 



Cre enzyme activity was measured in the presence or absence of the ligands, tamoxifen, by 
cotransfecting with the substrate plasmid (ad2265). BFP intensity (mean intensity/area) was 
quantified by analyzing fluorescent images captured by a digital camera using IP lab software. 
25 ND, refers to fluorescent intensities that were too weak to measure. Tet, Tetracycline; tarn, 

tamoxifen. 



Deletion of the PolvA Consensus Sequence from Cre or FLP Transcription Units 
30 To reduce the expression of FLP or Cre reconabinase further, the consensus 

polyA addition signals from the Cre or FLP transcript unit were deleted from vector 
constructs, leaving polyadenylation dependent on distal downstream sequences, for 
example, in gene IX. The activity of Cre using type B proviral constructs with or 
without the polyA signal was measured. As shown Table 4, the construct without polyA 
35 signals (AD229.3) showed a significant reduction of GFP intensity compared to a 



-31- 



wo 02/20814 



PCT/USOl/27682 



construct bearing the polyA signal (AD230.5). When FLPe constructs of similar 
structure were evaluated, similar results were found (data not shown). These data show 
that Cre and FLPe enzyme activity levels can be modulated by attenuating 
polyadenylation. 

5 

TABLE 4 

Effect of Deleting polvA Addition Signal From the Cre Expression Unit on Cre Enzyme 

Activity Level 

10 





polyA 


Relative GFP Intensity 


AD229.3 




0.25 


AD230.5 


+ 


1 



Post-Transcriptional Regulation of Cre Recombinase Activity 

Post-transcriptional control mechanisms of Cre recombinase activity were also 

15 evaluated. Translational fusions between Cre and the ligand binding domain (LED) of 
estrogen receptor have been reported to be regulated by estrogen (Feil et al., Proc. Nafl. 
Acad. Sci., U.S.A 93:10887-10890, 1996; Gossen et al., Proc. Natl. Acad. Sci., U.S.A. 
89:5547-5551, 1994), or, in the case of mutant estrogen receptors (Metzger et al., Proc. 
Nad. Acad. Sci. U.S.A. 92:6991-6995, 1995), by the partial antagonist tamoxifen. 

20 Use of a ligand-dependent recombinase (ad4394 in Table 3), in combination with 

the HIV LTR-based autoregulated Tet system, allowed for a small degree of regulation 
by tetracycline, but not by ligand, as assayed using the ad2265 rearrangement assay 
(Table 3). One interpretation of this finding is that fusion of the estrogen receptor LED 
to Cre provides only modest control of recombinase activity, but attenuates enzjme 

25 potency to a level so that transcriptional regulation can be measured. 

To increase control of recombinase activity, the LED was fused both to the N- 
terminus and C-terminus of Cre (LBD-Cre-LBD) and inserted into the coding sequence 
of both type A and type B vectors. When the LBD-Cre-LBD construct of type A was 
transfected into 293 cells, it showed no significant Cre enzyme activity even in the 

30 presence of ligand (Table 3). This result confirmed that the Cre recombinase activity is 
attenuated by N-terminal or C-terminal extension. 
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When the LBD fusion Cre enzymes were assayed in the type B vector context, 
only LBD-Cre-LBD fusions (pk8-ad4626) showed ligand-dependent regulation of Cre 
enzyme activities (Table 5). It appears that attenuated Cre activity in LBD-Cre-LBD, in 
the absence of ligand, is low enough to fall below the upper limit of the Cre assay. 

5 

TABLES 



Cre Enzyme Activities of Type B Provirus 



Cre Fonn 


Provirus 


-tam 


+tam 


Cre 


pk8-ad2239 


ND 


ND 


Cre-LBD 


pk8-ad4332 


4.1 


6 


LBD-Cre-LBD 


pk8-ad4626 


0.05 


4.2 



10 

Cre enzyme activity was measured in the presence or absence of the Ugands, tamoxifen. GFP 
intensity was quantified using IP lab software, 
tam, tamoxifen. 



15 

Consistent with this notion, only the construct carrying two LBDs, pk8- ad4626, 
was able to produce virus (AD121.5) by trausfection and propagate in 293 cells, while 
pk8-ad4332, which carried one LBD, produced virus (AD100.9) initially (following 
transfection of the cognate DNA) but was unable to propagate in 293 cells (Table 6). In 
20 the case of wild type Cre, no virus was produced in 293 cells by transfection. 



TABLE 6 



Production aud Propagation of Type B Adenovirus 

25 



Cre Fonn 


TypeB 
adenovirus 


Viral 

Production in 
293 cells 


Viral 

Propagation in 
293 cells 


Viral 

Propagation in 
#17 cells 


Cre 


Pack8-2239 








Cre-LBD 


AD100.9 
(Pack8-4332) 


+ 




+ 


LBD-Cre-LBD 


ADi21.5 
(Pack8-4626) 


+ 


+ 


+ 



30 
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The AD100.9 virus was able to propagate in #17 cells expressing the dorainant 
negative Cre Y324C, demonstrating that modulation of Cre activity is important for viral 
production. Thus, adenovirus carrying both two loxP sites and Cre in two different 
configurations were generated by controlling Cre activity. 

5 

Viral Rearrangement in Culture 

Cre/loxP mediated rearrangement of the adenovirus in tissue culture cells has 
also been analyzed. As shown in Figure 15A, the AD121.5 virus showed a significant 
increase in GFP expression in the presence of the ligand, estrogen, suggesting a 

10 successful rearrangement of the virus by Cre recombinases. When non-chromosomal 
DNA (Hirt, /. Mol Biol 40:141-144, 1969) was made from the cells and analyzed by 
DNA blot analysis, the viral DNA from estrogen treated cells was identified mostly in 
circular form (C in Figure 15B), while the DNA from cells not treated with estrogen was 
found mainly in linear form (L in Figure 15B). 

15 To evaluate the efficiency of the self -rearranging viruses in vivo, high titer stocks 

of AD102.7 (a type A virus carrying LBD-Cre, pk8-ad4394) in #17 cells was prepared 
and purified b}^ CsCl gradient ultracentrifugation. The titer of AD102.7 (4 -6 x 10^^/ml 
by OD) is comparable to or slightly exceeds that of control viruses (2~4xl0^^/ml by 
OD) which carry neither Cre nor a loxP site. To determine the efficiency of viral 

20 rearrangement in vivo and whether such rearrangement is dependent on the presence of 
ligand, AD102.7 virus (4 x 10^^ pfu/mouse as determined by optical density) were 
injected via tail vein into Rag-2 mice that were pretreated with vehicle alone or 1 10 
fxg/day of tamoxifen for 7 days as follows. 

Rag-2 mice were injected with either PBS (mock) or 4 X 10 adenovirus 

25 particles (as determined by OD260) of type A virus, AD102.7, via the tail vein. At 

various times after injection, animals were sacrificed and the liver tissues were removed 
and frozen rapidly on dry ice. To visualize GFP expression in animal tissues, mice were 
anaesthetized and perfused with 4% paraformaldehyde containing 0.2% glutaraldehyde 
intracardially (Kafri et al., Natl. Genet. 17:314-317, 1997), and the liver tissues were 

30 removed and fixed overnight at room temperature in the perfusion buffer containing 
30% sucrose. The fibced tissues were sectioned serially and observed under confocal 
scanning laser microscopy. In experiments evaluating the responses of ligand-regulated 
recombinase, noice were injected either with vehicle (vegetable oil) alone or with 110 
fxg/day of tamoxifen for 7 days prior to adenoviral injection. 
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Liver tissues from these animals were harvested at 2,5 hi's post injection (the 
earliest time point taken after injection) and Hirt DNA from approximately 250 mg of 
frozen hepatic tissue was prepared and analyzed by blot analysis. As shown in Figure 
16, the majority of adenoviral DNA was found in circular form in tissues from untreated 
5 nxice, as well as tamoxifen-treated mice. It can be concluded from these data that the 
Cre enzyme activity present in the tissue, even in the absence of ligand was sufficient for 
efficient self rearrangement of virus. As expected, the hepatic tissues from the Rag2 
mice injected with AD102.7 showed strong expression of GFP (Figure 17). 

Demonstratuig that AD102.7 virus, produced efficiently in 293 cells at high titres 
10 by the conventional means, can self rearrange efficiently in vivo provides the proof of 
the concept that potentially safer adenoviral gene therapy vectors can be produced. 

Adenoviruses Carrving Both FRT and FLP Recombinase 

Type A and type B proviral constructs carrying both FRT (FLP recombinase 
15 recognition site) and FLP recombinase were also generated. Structures of these viruses 
are analogous to those of loxP/Cre carrying viruses except that loxP sites are replaced by 
FRT sites and Cre coding sequence is replaced by FLP coding sequence (Figures 18A, 
18B). 

20 Virus Production at Reduced Temperature 

Temperature dependence of the Ef la promoter using GFP expression as a 
marker was also examined. As shown in Table 7, Ef la promoter activity is strongly 
reduced at 32°C in comparison to 37'^C or 39°C. The temperature sensitive nature of the 
Ef la promoter was used to propagate type B adenovirus carrying FLP at 32°C following 

25 initial production of the vkus by DNA transfection (pk8-ad3302) at 37*=^C. HepG2 cells 
infected with these viruses (AD41.4) showed strong GFP expression, but with an 
approximately 12 hr delay compared to GFP expressing viruses, suggesting that FLP 
recombinase activity may be impaired at 37°C. To iniprove the activity of FLP 
recombinase, viral constructs were created using a thermostable FLP (referred to as 

30 'TLPe") described by Buchholz et al. (Nat. Biotechnol. 16:657-662, 1998). 
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TABLE? 



Effects of Temperature on Efla Promoter Strength as Shown by GFP Intensities 



Tester plasmids 


Arbitrary GFP Intensities 




32°C 


37°C 


39°C 


Efla GFP 


16hrs 


1475 


7886 


11409 


41hrs 


6472 


36699 


50787 


86hrs 


16256 


53370 


54424 


Efla Cre +ad2204 


16hrs 


243 


1141 


2132 


41 hrs 


1094 


9119 


9784 


86 hrs 


695 


3219 


8144 



5 GFP intensities were measured using a Fluorescent reader. 



The activities of FLP and FLPe using a FLP substrate plasmid (ad2879, Figure 
19A) in 293 cells were compared. As shown in Table 8, FLPe is significantly more 
10 active than FLP under these conditions. 

TABLES 



FLPe is Significantly More Active than FLP Recombinase 

15 



Plasmid 




Mean GFP intensity 


ad4821 


Efla FLPe 


2.39 


ad2949 


Efla FLP 


0.01 



Plasmid coding either FLPe or FLP was cotransfected with a FLP substrate plasmid (Figure 19 A) 
into 293 cells. GFP intensity of each transfection was measured using IP lab program. 



20 

In addition, a tamoxifen-regulated FLPe was created by fusing the ligand-binding 
domain from a mutant form of estrogen receptor to the FLPe coding sequence at its C- 
temiinus (FLPe-LBD). The FLPe-LBD was found to be regulated by the ligand, 
tamoxifen (Table 9). Although FLP activity was retained by C- terminal fusion (FLP- 
25 LBD), addition of a short oligopeptide tag to the N-terminus of FLP abolished its 
activity (data not shown). 
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TABLE 9 



Tamoxifen Regulation of FLPe as Determined by GFP Intensities 



Plasmid 


FLPe 


-tarn 


+ tarn 


ad4821 +ad2879 


FLPe 


++++ 


+-H-+ 


ad5022 +ad2879 


FLPe-LBD(tam) 


+ 


+++ 



5 

GFP intensity resulting from FLPe mediated recombination was measured using a 
fluorescent microscope, tarn, 2 jUg/ml tamoxifen. 



10 Inhibition of FLPe Activities by Anti-sense FLPe 

An anti-sense approach to inhibit FLP enzyme activity was also employed. This 
approach tested the notion that incorporation of an open reading frame into an antisense 
transcript would stabilize the transcript and potentiate antisense activity. Two 
approaches were utilized. In one approach, the BFP coding sequence was placed 

15 upstream of anti-FLPe. In the second approach, an anti-FLPe was placed upstream of an 
internal ribosome entry sequence (HUES) and the BFP coding sequence (Figure 20). The 
ability of these constructs to inliibit FLPe was assayed using a FLP substrate plasmid, 
ad2879 (Figure 19A) and the result is shown in Figure 21. These data show that anti- 
sense FLPe is more effective in inhibiting FLPe function when it is fused to BFP, which 

20 can presumably be replaced with any other stable protein. 

OTEIER SELF-REARRANGING ADENOVIRUSES 

Mixed Infection With Adenoviruses Carrying loxP/FLP and FRT/Cre 
25 One of the ways to produce adenoviruses that can be rearranged in target cells 

but not producer cells is to engineer two separate viruses, each carrying one recombinase 
and the target sequence for the other. To test this system, type B adenoviral constructs 
carrying Cre recombinase and FRT sites and FLPe recombinase and loxP sites were 
created (Figure 22). In target cells infected with both viruses, Cre catalyzes 
30 recombination between the two loxP sites in the FLP virus, and FLP carries out FRT 
mediated recombination in the Cre virus, resulting in two circular plasmids. The loxP 
virus contained BFP, whereas the FRT virus contained GFP. Measurement of the 
fluorescent intensities of GFP and BFP, after cotransfecting the two constructs, revealed 
that BFP expression (mediated by Cre enzyme) was greater than GFP expression 
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(mediated by FLP enzyme), suggesting that the Cre enzyme functions more efficiently 
than FLP. 

Accordingly, these two recombinase activities in the target cells need to be 
balanced for complete circularization of both viral vectors. Exemplary methods for 
5 modulating Cre /FLP activity include the use of transcriptional regulation (such as by 
varying promoter strength and/or with or without poly A addition signal sequence) and 
translational and/or post-translational regulation (such as by changing FLP to FLPe and 
making LBD fusion proteins), and post viral production control (such as by changing the 
ratio of two viruses). 

10 In one approach, Cre was replaced by Cre-LBD and FLP was replaced by FLPe. 

To improve identification of the rearrangement products, BFP was replaced with RFP as 
a marker for Cre recombination. As shown in Table 10, in the presence of estrogen, 
expression of GFP (FLPe mediated) and RFP (Cre mediated) were similar. 

15 TABLE 10 



RFP and GFP Expression of Cells Cotransfected With Type B Proviral Constructs 
Carrying Cre-LBD/FRT(GFP) or PLPe/loxP (RFP) 



Plasmids 


Genotypes 


GFP intensity 


RFP intensity 


pk8-adsl20 
+pk8-adsll3 


Cre-LBD/FRT (GFP) 
+ FLPe/loxP (RFP) 


Estrogen 








+ 


3222 


3183 


46 


1954 



20 

To insure that both Cre and FLP carrying viruses with an optimal ratio infect 
each target cell, these viruses can be cross-linked prior to infection. For example, Cre 
carrying virus is labeled by biotin while FLP carrying virus is labeled by avidin. Mixing 

25 two types of modified viruses generates virus complexes of desired proportions as well. 
Biotinylation or avidinylation can be carried out using commercially available reagents 
such as EZ-Link TFP-PEO biotin (Pierce) and EZ-Link maleimide activated 
NeutrAvidin (Pierce). The extent of the biotin/virus and avidin/virus will be empirically 
determined to ensure the viability of the virus and to obtain an optimal ratio of two 

30 viruses in the complex. Optimal ratios will be those resulting in 1 : 1 Cre and FLP 
recombinase activities in target cells. The modifications will be done following 
manufacture's instructions. 
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This approach not only increases the effective capacity of adenoviral vector but 
also opens new avenue of applications involving multiple proteins, some of which 
cannot be coexpressed in production cell line as a result of combination toxicity. 

5 All references mentioned herein are hereby incorporated by reference. 

Other embodiments are within the claims. 



What is claimed is: 



10 
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Claims 

1. A replicatable viral DNA vector encoding a site-specific DNA-altering 
enzyme and a DNA target recognized by said enzyme, said enzyme selectively 
converting, in a cell expressing said enzyme, said DNA vector to a rearranged 
form. 

2. The vector of claim 1, wherein said rearranged form comprises an 
autonomously replicating episome, 

3. The vector of claim 1, wherein said rearranged form comprises linear and 
circular DNAs. 

4. The vector of claim 1, wherein said vector comprises adenoviral DNA. 

5. The vector of claim 1, wherein said vector comprises a genetically- 
engineered recombination site. 

6. The vector of claim 5, wherein said recombination site comprises a target 
of CreorFLP. 

7. The vector of claim 1, wherein said enzyme comprises a recombinase or 
an integrase. 

8. The vector of claim 7, wherein said recombinase is Cre or FLP 
recombinase. 

9. The vector of claim 1, wherein said enzyme is functional in a mammalian 
cell. 

10. The vector of claim 5, wherein said recombination site comprises a 
recognition sequence of a site-specific DNA-altering enzyme. 
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11. The vector of claim 1, wherein said vector comprises an origin of 
replication functioning in a mammalian cell. 

12. The vector of claim 1 1, wherein said origin of replication is an Epstein 
5 Barr Virus replicon. 

13. The vector of claim 1, wherein said vector comprises a gene of interest. 

14. A method for assembling a recombinant adenoviral DNA said method 
10 comprising the steps of: (a) providing a first linearized DNA vector comprising a 

restriction site and a cos site and a second linearized DNA vector comprising 
said restriction site, an adenoviral nucleic acid molecule, and a cos site; and (b) 
ligating said first and second linearized DNA vectors, said ligation assembling a 
recombinant adenoviral DNA. 

15 

15. The method of 14, wherein said first linearized DNA vector comprises a 
selectable marker. 

16. The method of claim 14, wherein said first linearized DNA vector 
20 comprises an adenoviral left end-inverted terminal repeat. 

17. The method of claim 14, wherein said first linearized DNA vector 
comprises a gene of interest. 

25 18. The method of claim 14, wherein said second linearized DNA vector 

comprises a selectable marker. 

19. The method of claim 14, wherein said second linearized DNA vector 
comprises an adenoviral right-end inverted terminal repeat. 

30 

20. The method of claim 14, said niethod further comprising packaging said 
assembled adenoviral DNA into a phage and infecting a host cell. 
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21. The method of claim 14, wherein said first and second linearized DNAs 
comprise a cosmid vector. 

22. The method of claim 14, wherein said adenoviral DNA is flanked by 
5 cleavage sites. 

23. The method of claim 22, wherein said cleavage sites comprise intron 
endonuclease cleavage sites. 

10 24. An adenovirus producer cell comprising a nucleic acid molecule that 

expresses a dominant negative site-specific DNA-altering enzyme. 

25. The producer cell of claim 24, wherein said site-specific DNA altering 
enzyme is a dominant negative recombinase. 

15 

26. The producer cell of claim 25, wherein said recombinase is a Cre or Flp 
recombinase. 



27. The producer cell of claim 26, wherein said dominant negative 
20 recombinase is CreY324C. 

28. The producer cell of claim 26, wherein said Flp recombinase is Flpe. 

29. The producer cell of claim 24, wherein said cell is a 293 human 
25 embryonic kidney cell. 



30. A vector comprising, in the 5' to 3' direction, 
a first genetically engineered cw-acting target recognized by a site-specific DNA 
altering enzyme; 
30 a gene of interest; 

a lineage-specific gene promoter; 

a second genetically engineered cis- acting target recognized by a site- 
specific DNA altering enzyme; and 

a nucleic acid molecule encoding a site-specific DNA altering enzyme. 

35 
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31. A vector comprising, in the 5' to 3' direction, 

a first genetically engineered cis-acting target recognized by a site- 
specific DNA altering enzyme; 
a gene of interest; 

5 a bi-directional promoter, comprising a second genetically engineered 

cw-acting target recognized by a site-specific DNA altering enzyme; and 

a nucleic acid molecule encoding a site-specific DNA altering enzyme. 

32. A method of gene therapy comprising the administration to a patient in 
10 need of gene therapy a therapeutically effective amount of the vector of any one 

of claims 1, 30, or 31 which is expressed in said patient 

33. A population of cells transfected with the vector of any one of claims 1, 
30, or 31. 



15 



34. A method of gene therapy comprising the administration to a patient in 
need of gene therapy a therapeutically effective amount of the population of cells 
of claim 33. 



20 
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FIG. 2A 
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FIG. 2B 
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FIG. 3B 
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FIG. 5 
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FIG. 7C 
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FIG. 9A 
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FIG. 9B 
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FIG. 10B 
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FIG. 10C 
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FIG. 11 
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FIG. 12 
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FIG. 15B 
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FIG. 21 
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SEQUENCE LISTING 
<110> The General Hospital Corporation 
<12 0> Self -rearranging DNA vectors 

<130> 00786/352WO3 

<150> US 60/231, 053 
<151> 2000-09-08 

<150> US 60/246,904 
<151> 2000-11-08 

<160> 10 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2341 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Adenovirus 



<400> 1 

gaatcggcca 

caatatgata 

tgacgtaggt 

gggaagttac 

gccccgtatt 

cagatcaatt 

cacgcacgtt 

ccgacaggtt 

taaggcgttt 

aagaaaggaa 

tgtttttgtc 

agcaaaaggc 

cccctgacga 

tataaagata 

tgccgcttac 

gctcacgctg 

acgaaccccc 

acccggtaag 

cgaggtatgt 

gaagaacagt 

gtagctcttg 

agcagattac 

ctgacgctca 

ggatcttcac 

atgagtaaac 

tctgtctatt 

gggagggctt 

ctcctgattt 

caactttatc 



gcgcgaattc 
atgagggggt 
tttagggcgg 
gtacgcggca 
cagtgtcgct 
aatacgatac 
gtgatatgta 
acggggcggc 
ccgttcttct 
acgacaggtg 
cgtggaatga 
caggaaccgt 
gcatcacaaa 
ccaggcgttt 
cggatacctg 
taggtatctc 
cgttcagccc 
acacgactta 
aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatccggc 
atcagcaata 
cgcctccatc 



gattatcatc 
ggagtttgtg 
agtaacttgc 
tcgatgcgcg 
gatttgtatt 
ctgcgtcata 
gatgataatc 
gacctcgcgg 
tcgtcataac 
ctgaaagcga 
acaatggaag 
aaaaaggccg 
aatcaacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 
aactcacgtt 
ttaaattaaa 
agttaccaat 
atagttgcct 
cccagtgctg 
aaccagccag 
cagtctatta 



atcataatat 
acgtggcgcg 
atgtattggg 
ggatatcgcg 
gtctgaagtt 
attgattatt 
attatcactt 
gttttcgcta 
ttaatgtttt 
ggctttttgg 
ttaacggatc 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
caaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccGcgt 
caatgatacc 
ccggaagtgc 
gttgttgccg 



accttatttt 
gggcgtggga 
aattgtagtt 
gcggctagcg 
gtttttacgt 
tgacgtggtt 
tacgggtcct 
tttatgaaaa 
tatttaaaat 
cctctgtcgt 
caggccgcga 
gtttttccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
ctccaagctg 
taactatcgt 
tggtaacagg 
gcctaactac 
taccttcgga 
tggttttttt 
tttgatcttt 
ggtcatcaga 
taaatcaatc 
tgaggcacct 
agtgtagata 
gcgtgaccca 
cgagcgcaga 
ggaagctaga 



ggattgaagc 
acggggcggg 
tttttaaaat 
acatgaggtt 
taagttgatg 
tgatggcctc 
ttccggtgat 
ttttccggtt 
accctctgaa 
ttcctttctc 
gcaaaaggcc 
aggctccgcc 
ccgacaggac 
gttccgaccc 
ctttctcata 
ggctgtgtgc 
cttgagtcca 
attagcagag 
ggctacacta 
aaaagagttg 
gtttgcaagc 
tctacggggt 
ttatcaaaaa 
taaagtatat 
atctcagcga 
actacgatac 
cgctcaccgg 
agtggtcctg 
gtaagtagtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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cgccagttaa tagttttcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct 1800 
cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat 1860 
CGcccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatagtt gtcagaagta 192 0 
agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca 19 80 
tgccatccgt aagatgcttt tctgtgactg gtgagtattc aaccaagaat acgggataat 2040 
accgcgccac atagcagaac tttaaaagtg ctcatcattg ggaaacgttc ttcggggcga 210 0 
aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgcgcaccc 2160 
aagtgatctt ctgcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg 2220 
caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact catacttttc 2280 
ctttttcaat attattgaag catttatcag ggttattgtc tcatcagcgg atacatattt 2340 
g 2341 



<210> 2 
<211> 34616 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Adenovirus 



<400> 2 

gaatcggcca 

cttattttgg 

gcgtgggaac 

ttgtagtttt 

gaagttgtgg 

gttttttgtg 

ttggcccttt 

aggttttttt 

ttctggagcg 

ttttctctcc 

tgcgggtatg 

aacctgatgt 

tggtgctttt 

gtcttatgct 

ttttttttgt 

aatggtgtct 

ctacgatgtg 

ttttatatcg 

gagtatgcgt 

cgcgctggtc 

ggatcgcggt 

tgaatttttg 

atttttacaa 

cgagatgaaa 

cctgaagggt 

attgtgcaac 

accggagggg 

gaataaaaaa 

gcagaacgaa 

ggcagcggcg 

gagagagtgg 

cagatctgtt 

ttccatttgg 

gtagggatcg 

cgctgctgcc 

cctgcagtgt 

cgcgggagga 



gcgcgaatta 
attgaagcca 
ggggcgggtg 
tttaaaatgg 
gttttttggc 
gactttaacc 
ttacactgtg 
actggtaagg 
ggagggtgct 
tattaatttt 
tattcccccg 
gtttaccgag 
taatcacggt 
tataagggtt 
tattttattt 
ttttctgtgg 
cttgcttttt 
ccgcccatgc 
gtcataatca 
cgtgcagacc 
atttttgtta 
caatcatgat 
tggccggact 
attatttggg 
ttagccttta 
atcttacaaa 
agcgcgttca 
aaaaaaaaca 
tgtgtaggtt 
catgaaggag 
atatactaca 
tgtcacgccc 
catgacacta 
cctacctcct 
cgaatgtaac 
gggatttacg 
gcttgtaatc 



actataacgg 
atatgataat 
acgtaggttt 
gaagttacgt 
tttcgtttct 
gttacgtcat 
actgattgag 
ctgactgtta 
attttgccta 
gttatacctc 
ggctatttcg 
tcttacatta 
gaccagtttt 
gtttttcctg 
tgtgtttaat 
tggttccgga 
tgcgcgaggc 
aacaagctta 
gtgtgggttc 
tgcacgatta 
atgttccgct 
tcgctgcttg 
taatattcgg 
catggttgaa 
cgtccacttg 
tgccattatc 
cttaatagat 
tggttcttcc 
ggctgggtgt 
tttacataga 
actactacac 
gcacctggtt 
cgaccaacac 
tttgagacag 
actttgacaa 
ctgattcagg 
ctgaggaagt 



tcctaaggta 
gagggggtgg 
tagggcggag 
atcgtgggaa 
gggcgtaggt 
tttttagtcc 
ctggtgccgt 
tggctgccgc 
ggcaggaggg 
ctatgggggc 
gtcgcttttt 
tgactccgga 
tttacggtca 
ttgtaagaca 
gcaggaaccc 
acttacctgc 
tttgcctgat 
cataggggct 
ttttgtcatg 
tgttcagctg 
tttgaatctt 
aggctgaagg 
gatttgctta 
ggtgctggaa 
gacgtgaggg 
tgttctttgg 
cttcattttg 
agctcttccc 
ggcttattct 
acccgaagcc 
agagcgagct 
ttgcttcagg 
gatctcggtt 
agacccgcgc 
tgcacaacgt 
aatgggttgt 
gtatgcacgt 



gcgtcatcat 
agtttgtgac 
taacttgcat 
aacggaagtg 
tcgcgtgcgg 
tatatatact 
gtcgagtggt 
tgtggaagcg 
tttttcaggt 
tgtaatgttg 
agcactgacc 
catgaccgag 
cgccggcatg 
ggcttctaat 
gcagacatgt 
ctttatctgc 
tttttgagca 
acgctggtta 
gttcctggcg 
gccctgcgaa 
atacaggtct 
tggagggcgc 
gagacatatt 
tgtttataga 
cagtttgcct 
ctgtagagtt 
aggttttgga 
gctcctcccg 
gcggtggtgg 
agggggcgcc 
aagcgacgag 
aaatatgact 
gtctcggcgc 
taccatactg 
gagttacgtg 
tccctgggat 
gtgcctgtgt 



cataatatac 
gtggcgcggg 
gtattgggaa 
aagatttgag 
ttttctgggt 
cgctctgtac 
gttttttaat 
ctgtatgttg 
gtttatgtgt 
tctctacgcc 
gatgttaacc 
gaactgtcgg 
gccgtagtcc 
gtttaaatgt 
ttgagagaaa 
atgagcatga 
gcaccttgca 
gcatagctcc 
gggaagtggc 
gggacctacg 
gtgaggaacc 
tctggagcag 
gataaggtgg 
ggagattcac 
tttggaagcc 
tgaccacgcc 
taatcttttg 
tgtgtgactc 
atgttatcag 
tggatgcttt 
accggagacg 
acgtccggcg 
actccgtaca 
gaggatcatc 
cgaggtcttc 
atggttctga 
tgtgccaaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 . 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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ttgatatcat gacgagcatg atgatccatg 
gttccagtcc cggttccctg cagtgcatag 
ggatggtggt ggatggcgcc atgtttaatc 
attacaacat gccaaaagag gtaatgttta 
taatctacct gcgcttgtgg tatgatggcc 
ttggatacag cgccttgcac tgtgggattt 
actgtgctga tttaagtgag atcagggtgc 
tgctgcgggc ggtgcgaatc atcgctgagg 
cggagcggcg gcggcagcag tttattcgcg 
tgcacgatta tgactctacc cccatgtagg 
aaccgcaagt tggacagcag cctgtggctc 
agctgcccgg ggagtttatt aatatcactg 
ggaatataac acctaagaat atgtctgtta 
ggggagaaag gactgtgtac tctgtgtgtt 
ttctgtgagt ttgattaagg tacggtgatc 
tactgaatga aaaatgactt gaaattttct 
acatgcaaca ggttcacgat tctttattcc 
gtagcaaaag tttcagtggt gtattttcca 
taagtgctta cctcgctagt ttctgtggat 
atcataggtt tagttttatc accatgcaag 
gtttgacttt gggtttttgg ataggctaga 
tatggatttg cattagttga gttcccattt 
gagttctcca ttagaacacc gttttggtca 
gtgcctgtca tggatgaaag atctccagat 
tgactcccac attttgtaag aaccaaagta 
cagttaggag atgggtctgg ggttgtccac 
attgtaatgg cccctgagtt gtcaaagctt 
ccgttttcat tgtaatcaat gccagagcca 
gactcagatg tgtttgtatc aaactccaga 
ttatcaaagt ttagtccact ggatttfcttt 
gatgcattaa aaaggtatag gcctctgtta 
atatacaggg gtccctgccc cagtttaaga 
tccacatcta gaattaacaa gttgttattt 
atgttgtttg atgaatcata accaatagct 
tcaacggtga cacctggtcc agtaactact 
aaaggaccgc ttattttaat tcctattttt 
ttaatgccca agctacccgt ggcagtagtt 
tcgctgtcac tgccagagag gggggctgat 
gtaatgggcc ctttagtagc aatgcttagt 
gactgtacgc taagagcgcc gctagtaact 
gcgcctgagg taattgtaag tggtgcggag 
ttaagtggct gagtaacagt ggttacattt 
gtaagaccgc tgcccatttt aagcgcaagc 
acgcgtagag agagaactcc agggggactt 
ggggtaagaa agggcacagt tggaggcccg 
aaggtgtctt cagacggtct ggcgcgtttc 
gagggacaag aacatgagga atttgacatc 
aaaaggcggc tgagatacca gagttgggag 
gacaaagatt tgctgactga ttttaagtaa 
tggaataaga tctctaatac cacacatggt 
ctgatagggg aagtgcaggc agccctctgt 
tttttctccc accataagca ccagtttttg 
gttgccggta gtggtttttt cgtaggtaag 
tcttttacac tggtgtaggt taaccatgtc 
ggacgccgcc ttgcgccttt ctagtaggcg 
atctagagat tcagtcatct ccacctgtca 
gagaaggggg gcgaggctga ttgattgggg 
gcactgggaa agtagggtgg ttcatggcat 



gttacgagtc ctgggctctc cactgtcatt 2280 
ccggcgggca ggttttggcc agctggttta 23 40 
agaggtttat atggtaccgg gaggtggtga 2400 
tgtccagcgt gtttatgagg ggtcgccact 2460 
acgtgggttc tgtggtcccc gccatgagct 2 52 0 
tgaacaatat tgtggtgctg tgctgcagtt 2580 
gctgctgtgc ccggaggaca aggcgtctca 2 640 
agaccactgc catgttgtat tcctgcagga 2700 
cgctgctgca gcaccaccgc cctatcctga 27 60 
cgtggacttc cccttcgccg cccgttgagc 2820 
agcagctgga cagcgacatg aacttaagcg 2 880 
atgagcgttt ggctcgacag gaaaccgtgt 2940 
cccatgatat gatgcttttt aaggccagcc 3 000 
gggagggagg tggcaggttg aatactaggg 3 060 
aatataagct atgtggtggt ggggctatac 3120 
gcaattgaaa aataaacacg ttgaaacafca 3180 
tgggcaatgt aggagaaggt gtaagagttg 3240 
ctttcccagg accatgtaaa agacatagag 33 00 
tcactagtgc cattaagtgt aatggtaagt 33 60 
taaacttgac tgacaatgtt atttttagca 3420 
aggttaggca taaatccaac tgcatttgtg 3 48 0 
ctaaagttcc agtaatgttt tttaagtgag 3 540 
aatctaagga atatactaac acttgcaacg 3600 
acagccaaag cagctacagt agctagtact 3 660 
aatttgcagt cattatctga atgaattctg 3 72 0 
agggtaagtt tgtcatcatt tttgtttcct 37 80 
aaacccgctc caagtttagt aatcatggca 3840 
attttagttt ttattgggtt gatatctgga 3900 
ccctttcctg catttatagc tatggcagta 3960 
atgctaactt ccagtttttt agtattgttt 4020 
tagtttatgt ccaagttatg agatgcatta 4080 
cgtagttttg tttgagcatc aaatgggtaa 4140 
atacgcatgc caccgcccgt tttaatttcc 4200 
cctgcaactt tggttctaag ggagttttgt 42 60 
gttagtgtat cggagttttg tgctacttgc 432 0 
ccattattta cataaatagg atcttccatg 4380 
agcgggggtg atgcagttac agtaagggtg 4440 
gtttgcaggg ctagctttcc atctgacact 4500 
ttggagtctt gcacggtcag tggggcttgt 4560 
atcagaggag cggtggttgc cactgttagg 462 0 
gtgtccaaac ttatgtttga ctttgttttt 4680 
tgggaggtga ggtttccggc cttgtctagg 4740 
atgccgtggg aggtgtccaa aggttcggag 4800 
tcttggaaac cattgggtga aacaaatgga 4860 
gtttctgtgt catatggata cacggggttg 4920 
atctgcaaca atatgaagat agtgggtgcg 4980 
ccatttaaac tttggagaaa gtttgcagct 5040 
gaaggaaagg aggtgatgct gaataagctg 510 0 
gtaatttatt gtgtgtttat gttagttgaa 5160 
tttaataaga gtgcagaggt cctctggacc 5220 
ttctgccgag tgctgggtga cggtgatagg 52 80 
gcgctgggtg ggtagcttgt agctgaggcg 5340 
tttggcctgc ttgaccacac aaaagatacc 5400 
ttcaacttct tgttttaggc gttctcgctc 5460 
ctgttcggtg ttaattccat ccaattctag 5520 
aattaaagta gctaatctca gtgggggtgg 5580 
caataacctg ttgcagtggt atgacagcgg 5640 
ctatggcatt ccagccaatg tcaaggtatg 57 00 
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gatatatggc tagggcaaaa atggtactgc 
accaggcttc tgacaaatcg ctctgtttgt 
tgaatctgca ggaaatatgt cttttgggag 
gggcgcaaaa aatcagcaaa acaaaaatga 
aagatcagct atagtcctgt ctctgtattg 
acaaacccag tcaatgaact gaatgaaggc 
gataagggtg acaaatccgt aaagcaggta 
gatgtgagcg accgcggcca atgtagagca 
tataagaact cgaggaatca tgtctcattt 
tcagaccgtc caatctatga attttttcat 
agatgggggg tctggcgcgt ctgcgcttta 
aaacaaacafc aagcgctatg gaaaaccacc 
aggcatgaac ttggggtaaa tttagggcag 
agtccgttgt gggcgcgatg gttgagccgg 
gtttgatcaa atttgcagtg caggcgctgg 
ttgatttgaa ggttgtgggt ataatcttgc 
ttgtccaggg gaatacaagc aagcggaaaa 
gcgtctgctt fctgtatttga gataaagtaa 
cagagcggtg gaacaaaagg tgccagtgtt 
gtactgtttg ctcatgtaca tggtaatatc 
aaaagtttta cgattttcac cttggaagac 
ctgccaaata gcatatacag catacttgcc 
ttcatgttct gtggtgcatt ttataagagt 
aacattgcaa gctggttcct taaactcaac 
ggcgagcaag cctaaaatca tgtacctcat 
tgacaatagg tacaaacgtg cgtgcagcag 
ataagaataa acagaattac aagagtaagg 
gacaagcttg tagagttact tgaattgctc 
gcaaaatgct tttttgacct gagttccggg 
g*g"9'3-gtaatg tctggttacg ctcaggctgt 
cgtacgttcc cggcaggtga ggagggtggt 
gaagccgaga aggttgtgtg gcaaacttac 
aaatgaagag ccgttaaagt accaggtaag 
ggtgaggttt gctttggtct gctttgggtg 
acaggagccc cagtagattc taatttctgt 
aaagatcttg atgtaatcca gggttaggac 
cccgctcccg ctccactagc agggggcgct 
gctctacctg ggtggfcgagc cggacgccgt 
attcaaagta acaaaactca ccggagccgc 
cgaggtgfcgt caggcgcagt cgctctgcct 
ccgagtcttt caccgcgtca aagttgggaa 
cagaaaaggg gttgaagtaa accgaaggca 
tgcctccgga gcgcggctcc gaggacgagg 
taaatgaaga gcggccagcg ccgccgatct 
aggagctcac cgactcgtcg ttgagctgaa 
taccctgccc gggcgaccgc accctgtgac 
tagtcatctg aacttcggcc tgggcgtctc 
tttcctggta caccagggca gcgggccaac 
tggtaatagc cgcctgttcg aggagaattc 
gggatatcat gtggggtccc gcgctcatgt 
ctccagccgc aagtcccatt tgtggctggt 
tgctcataat ggcgctgacg acaggtgctg 
gttttcgcgc ttaaatttga gaaagggcgc 
atttgctgaa gagagcctcc gcgtcttcca 
gatacaggca gctgcgggtg agggagcgca 
cttggcccct gctttgttga aatatagcat 
cgcgggtcga tacgggttcg ttgggcgcca 
ccgctgtgga tttcttgggc tttgtcagag 



aaaaaaccat gacagagatg atggcgtata 5760 
tgtagcagct gggaatgttc catatttgag 5820 
gcgctgaggt ttgggagcaa agcacaggta 5880 
cactccgttt cataattaaa gaattctgag 5940 
cggatggtgc ctgaggtacg caatgcgcac 6 0 00 
gatgactaca gtgacgaggc tgcagatgag 606 0 
aactgtgaaa ggtgggatgc aatctacttc 612 0 
cgcacagaaa agcgcaacaa gggtcaataa 6180 
aatcatactg taaaagaaga gaacatggtt 6240 
tgtgtgggtt gagcacaatg ataggcctat 6300 
ggcaacaaat aagccacata ataataaggc 63 60 
acatgtccaa gctcgcccag tcattgacaa 642 0 
atgttagtcc ggtagcagtg gtgttgcgat 6480 
tcatctctgg agcaggcaag ctgaagctgg 6540 
cagaaatcag gcgctaacgt ccaggaaagt 6600 
ccgcctggag catatcccac atagagtaaa 6660 
tcaaggcatt ttcttttcat caataaaact 6720 
ggtacatacc aaagcaagcg ctgtaataag 6780 
ctctaaacac ttttgtgggg gccacaactt 6840 
gcacatttca taaaatggaa atttatacat 6900 
tgtgacatta tagtcgttag tgtcacctgg' 6960 
aattttgtct ttgtggcgaa taataagctt 7020 
agtgcattca ttagcttctg atttaaatgt 7080 
ctttttggca gcgctgcaga ctgccgcaag 7140 
cttggatgtt gcccccagcg tttaaaaagc 7200 
gcggcaaccc taaggcacag aagtgctagt 7260 
ataaccccga ccccaattcc agaaaaatta 7320 
atatacttaa ttaaaaaatc ccagcacccc 73 80 
agttgagctc acctcctgtt ttggaaaaat 7440 
aggtgtgggc gcagcaaccg gtgacgcact 7500 
ggtggtggtg tttttcttga cggtgtagtt 7560 
ttcgtctcgc tggaaactgt tgtaaattac 7 62 0 
gtacttattg gcccgcttgt gcaaaccgga 76 80 
ggtaaaaacg gtggcgttca caggatggcg 7740 
atttattata ctcagcacag agatgacaac 7800 
agttgcaaac cacggtcaga acacagggac 7860 
tggtaaactc ccgaatcagg ctacgtgtaa 7920 
gcgccgggcc ctcgatatgc tcttcgggca 7980 
gggcaaagca cttgtggcgg cggcagtggt 8040 
ctccactggt cattcagtcg tagccgtccg 8100 
taaactggtc cgggtagtgg ccgggaggtc 8160 
cgaactcctc aataaattgt agagttccaa 8220 
tctgcagagt taggatcgcc tgacggggcg 82 80 
gaaatgtccc gtccggacgg agaccaagag 8340 
tacctcgccc tctgattttc aggtgagtta 8400 
gaaagccgcc cgcaagctgc gcccctgagt 8460 
tgggaagtac cacagtggtg ggagcgggac 8520 
tacggggatt aaggttatta cgaggtgtgg 8580 
ggtttcggtg ggcgcggatt ccgttgaccc 8640 
agtttattcg ggttgagtag tcttgggcag 8700 
aactccacat gtagggcgtg ggaatttcct 8760 
gcgccgggtg tggccgctgg agatgacgta 882 0 
gaaactagtc cttaagagtc agcgcgcagt 8880 
gcgtgcgccg aagctgatct tcgcttttgt 8940 
gagacctgtt ttttattttc agctcttgtt 9000 
acagagtggg aaaaatccta tttctaagct 9060 
gacgcagcgc tcctcctcct gctgctgccg 9120 
tcttgctatc cggtcgcctt tgcttctgtg 9180 
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tgaccgctgc tgttgctgcc gctgccgctg 
cggtagtaat gcaggatgtt acgggggaag 
cJQ'Q'gcgaa-gQ' agatgttgcc cccacagtct 
ccgcgccacg agcggtagcc ttggcgctgt 
ttagacttac cggccctggt tccagtggtg 
agtgccggcg gcgcctgagg agcggaggtt 
tggggcgccg gcgaggggaa tgcgaccgag 
gcctcggaag cttcgtctag gctgtcccag 
acctcctctg cctgactgtc ccagtattcc 
tgcagcttct ttttgggtgc catcctggga 
cggcggcggg gggattgggt tgagctcctc 
tccctttcgt agcagaaact cttggcgggc 
gtggccctgg gtaatgacgc aggcggtaag 
gtagaaccta atctcgtggg cgtggtagtc 
cgtccacagc cccggagtga gtttcaaccc 
accctgcagc tcaaaggtac cgataatttg 
cagggagcgg tgcggggtgc ataggttgca 
gctcacgtct tccatgatgt cggagtggta 
gcagtgaccc caaagcggcg gagggcattc 
aagcgcacag caggtggcgg gcagaattcc 
ttgcaacatg ctttgactgg tgaagtctgg 
ttcggggaag ataatgtccg ccaggtgcgc 
taggtccttc aagttttgct ttagcagctt 
gcattgctgc cacacgccca tggccgtttg 
gtcgcggacg tagtcgcggc gcgcctcgcc 
gaggcggttt tcgtgcaaaa ttccaaggta 
aattttgcag gcctggcgca cgtagccctg 
cttgcgctgc atctccgggt cagcaaagaa 
aagcactgcg gccatcatta gcttgcgtcg 
aagccagcgc gccagctgct catcgccaac 
caagtttgca tccctctcca ggggtcgtgc 
gctcataacc ttggggggta ggttaagtgc 
gcgtttcagc acggctaggc gcgcgttgtc 
actttcattt tcgctgtttt cttgttgcag 
aagaccctca aagatttttg gcacttcgtc 
ctgccgcaag gccagctgct tgtccgctcg 
cttgcagttt tggaaaaaga tgtgataggt 
gtagaagttg aggcgcgggt tgggctcgca 
gcgcggtgag aacaggtggc gttcgtaggc 
atcgctgcgc tcttgcaacg cgtcgcagat 
cagcacgtcg tctcccacat ctaggtagtc 
ttcctcgttt gcctctgcgt cgtcctggtc 
atcctcgtcg tcttcgctta caaaacctgg 
aagcgggggt gcctcgacgg ggaaggtggt 
ggtggcgaac tcaaaggggg cggttaggct 
tttctgccta taggagaagg aaatggccag 
cgagcgcgga cgcggtgcgg cgcgacgtcc 
gccgtcgccg ccgcctcccc gcgcgccccc 
cgaggacgaa gaagactcgt cacaagatgc 
atcgacctcg acggcggatt tggccattgc 
caagcccgag cgcccgccat ccccagaggt 
tgtggcgcta caaatggtgg gtttcagcaa 
aggtaagcgc acggtgcggc ggctgaatga 
gcaagaggaa aaggaagagt ccagtgaagc 
gagcctgccg atcgtgtcfcg cgtgggagaa 
caagtaccac gtggataacg atctaaaggc 
agctctggcg gccgtatgca agacctggct 
cttcaccagc aacaagacct ttgtgacgat 



ccgccggtgc agtaggggct gtagagatga 9240 
gccacgccgt gatggtagag aagaaagcgg 93 00 
tgcaagcaag caactatggc gttcttgtgc 93 60 
tgttgctctt gggctaacgg cggcggctgc 942 0 
tcccatctac ggttgggtcg gcgaacaggc 9480 
gtagcgatgc tgggaacggt tgccaatttc 9540 
ggtgacggtg tttcgtctga cacctcttcg 9600 
tcttccatca tctcctcctc ctcgtccaaa 9660 
tcctcgtccg tgggtggcgg cggcggcagc 9720 
agcaagggcc cgcggctgct gatagggctg 97 80 
gccggactgg gggtccaggt aaaccccccg 9840 
tttgttgatg gcttgcaatt ggccaaggat 99 00 
ctccgcattt ggcgggcggg attggtcttc 99 60 
ctcaggtaca aatttgcgaa ggtaagccga 10020 
cggagccgcg gacttttcgt caggcgaggg 10080 
actttcgcta agcagttgcg aattgcagac 10140 
gcgacagtga cactccagta ggccgtcacc 10200 
ggcaaggtag ttggctagct gcagaaggta 10260 
acggtactta atgggcacaa agtcgctagg 10320 
tgaacgctct aggataaagt tcctaaagtt 103 80 
cagaccctgt tgcagggttt taagcaggcg 10440 
ggccacggag cgctcgttga aggccgtcca 10 50 0 
ctgcagctcc tttaggttgc gctcctccag 10560 
ccaggtgtag cacagaaata agtaaacgca 10620 
cttgagcgtg gaatgaagca cgttttgccc 10680 
ggagaccagg ttgcagagct ccacgttgga 10740 
gcgaaaggtg tagtgcaacg tttcctctag 10800 
ccgctgcatg cactcaagct ccacggtaac 10860 
ctcctccaag tcggcaggct cgcgcgtctc 10920 
tgcgggtagg ccctcctcgg tttgttcttg 10980 
acggcgcacg atcagctcgc tcatgactgt 11040 
cgggtaggca aagtgggtga cctcgatgct 1110 0 
accctcaagt tccaccagca ctccacagtg 11160 
agcgtttgcc gcgcgtttct cgtcgcgtcc 1122 0 
gagcgaggcg atatcaggta tgacagcgcc 11280 
gctgcggttg gcacggcagg ataggggtat 11340 
ggcaagcacc tctggcacgg caaatacggg 11400 
tgtgccgttt tcttggcgtt tggggggtac 11460 
aaggctgaca tccgctatgg cgaggggcac 11520 
aatggcgcac tggcgctgca gatgcttcaa 11580 
gccatgcctt tggtcccccc gcccgacttg 11640 
ttgcttttta tcctctgttg gtactgagcg 117 0 0 
gtcctgctcg ataatcactt cctcctcctc 11760 
aggcgcgttg gcggcatcgg tggaggcggt 11820 
gtcctccttc tcgactgact ccatgatctt 11880 
tcgggaagag gagcagcgcg aaaccacccc 11940 
accaaccatg gaggacgtgt cgtccccgtc 12000 
aaaaaagcgg ctgaggcggc gtctcgagtc 12 060 
gctggtgccg cgcacaccca gcccgcggcc 12120 
gtccaaaaag aaaaagaagc gcccctctcc 12180 
gatcgtggac agcgaggaag aaagagaaga 12240 
cccaccggtg ctaatcaagc acggcaaggg 123 00 
agacgaccca gtggcgcggg gtatgcggac 12360 
ggaaagtgaa agcacggtga taaacccgct 12420 
gggcatggag gctgcgcgcg cgttgatgga 12480 
aaacttcaag ctactgcctg accaagtgga 12540 
aaacgaggag caccgcgggt tgcagctgac 12 600 
gatggggcga ttcctgcagg cgtacctgca 12660 
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gtcgtttgca gaggtaacct acaagcacca 
ccgctgcgct gagatcgaag gcgagcttaa 
ggagcacgtg attgaaatgg atgtgacgag 
gtctagcaag gccaagatcg tgaagaaccg 
caccgacgca aggtgctgcg tgcatgacgc 
gtcttgcggc atgttcttct ctgaaggcgc 
ggctttcatg caggcgctgt atcctaacgc 
actacggtgc gagtgcaact caaagcctgg 
aaagttgact ccgttcgccc tgagcaacgc 
caagagcgtg ctggccagcg tgcaccaccc 
tgtgtatcgc aactcgcgcg cgcagggcgg 
gcccgacctg ctaaacgcgt tggtgatggt 
gctgccgcgg atggttgtgc ctgagtttaa 
gtccctgcca gtggcgcata gcgatgcgcg 
gacggcaagg gtggggggta aataatcacc 
tattgaaagt gtctcctagt acattatttt 
gcgctcctaa tctgcgcact gtggctgcgg 
tagagctgtt cctggttgcg acgcagggtg 
ttgggtaccc cggtaataag gttcatggtg 
ttggcaaagg cgtggagaaa catgcagcag 
tgcacgcttt gggtggactt ttccagcgtt 
ctacggcgca ggagtgactc gtactcaaac 
aagccaaagg gctcaaagag gtagcatgtt 
cagtgtacgc ccccagtctc gcgaccggcc 
ggagaaacaa agcctggaaa gcgcttgtca 
agatctttga caatggcttt cagttcctgc 
gatgttgctt gcttctttta tgttgtggcg 
acggtctcga tgacgccgcg gtgcggctgg 
aaaacataaa gaagggtggg ctcgtccatg 
tgggcggagt tggcgtagag aaggttttgg 
aagttactgg agaatgggat gcgccaaagg 
atactgtcaa ccgcggtttt gcctattagt 
ccctcgcgca tggtgggagc gaggtagcct 
attccaacct gctgatactc cttgtattta 
tggaagtttc tgaagaacga gtacatgcgg 
tggtagccaa tattgtagtt ggccaacatc 
cactgagcta cgttgtagcc ctccccgtca 
gtaagcaggc ggtcgttgcc cggccagcta 
ttaaaggtgt gattaagata gaaggttccg 
tagtaagggt cgtagcctga tcccagggaa 
gcccaaccgc gaaatgctgc ccagttgcgc 
ttggcgggta tggggtatag catgttggcg 
ttggtgtcat ttctgagcat ggcttccagc 
aaggtggcgt aaagacaaat gctgtcaaac 
tttcccagag agctctgcag aaccatgtta 
tafcgagcctg gcaggaggag gaggttttta 
tgaaagggca cgtagcggcc gtttcccaac 
cggtggtggfc taaagggatt aacgttgtcc 
ttaatgtagc agtctacaag cccgggagcc 
ttggggttgt cagatatttc cacattggtg 
agcgcaatat tggagtaaag gaaatttctc 
gcaaagttgt tacccactcc tatttcatta 
gtagtatctc cattatcgcc tgagccattg 
gttaccccaa tacccccaag aggaaaacaa 
ttttcaatga ttctaacatc tggatcatag 
tatctggttc tatcacctat ggaatcaagc 
tcttgcaaat ctaccacggc atttagctgc 
ccagtgctgt tataatacat taggccaata 



cgagcccacg ggctgcgcgt tgtggctgca 12720 
gtgtctacac gggagcatta tgataaataa 127 80 
cgaaaacggg cagcgcgcgc tgaaggagca 12840 
gtggggccga aatgtggtgc agatctccaa 129 0 0 
ggcctgtccg gccaatcagt tttccggcaa 12960 
aaaggctcag gtggctttta agcagatcaa 13 02 0 
ccagaccggg cacggtcacc ttctgatgcc 13 080 
gcatgcaccc tttttgggaa ggcagctacc 13140 
ggaggacctg gacgcggatc tgatctccga 1320 0 
ggcgctgata gtgttccagt gctgcaaccc 13260 
aggccccaac tgcgacttca agatatcggc 1332 0 
gcgcagcctg tggagtgaaa acttcaccga 133 80 
gtggagcact aaacaccagt atcgcaacgt 13440 
gcagaacccc tttgattttt aaacggcgca 13 50 0 
cgagagtgta caaataaaaa catttgcctt 13 560 
tacatgtttt tcaagtgaca aaaagaagtg 13 62 0 
aagtagggcg agtggcgctc caggaagctg 13 680 
ggctgtacct ggggactgtt aagcatggag 13740 
gggttgtgat ccatgggagt ttggggccag 13 800 
aatagtccac aggcggccga gttgggcccc 13 860 
atacagcggt cgggggaaga agcaatggcg 13920 
tggtaaacct gcttgagtcg ttggtcagaa 13980 
tttgagcgcg ggttccaggc aaaggccatc 14040 
gtattgacta tggcgcaggc gagcttgtgt 1410 0 
taggtgccca aaaaatatgg cccacaacca 14160 
tcactggagc ccatggcggc agctgttgtt 1422 0 
ttgccggccg agaagggcgt gcgcaggtac 142 80 
tgcacacgga ccacgtcaaa gacttcaaac 14340 
ggatccacct caaaagtcat gtctagcgcg 14400 
cccaggtctg tgagtgcgcc catggacata 14460 
gtgcgatcgc aaagaaactt tttctgggta 14520 
gggtagggca cgttggcggg gtaagcctgt 14580 
acgaatcctg agttgttatg ctggtgaaga 14640 
gtatcgtcaa ccacttgccg gctcatgggc 1470 0 
tccttgtagc tttctggaat gtagaagccc 14760 
tgcaccagga accagtcctt ggtcatgttg 14820 
actgagcgtt taatctcaaa ctcattggga 14880 
acagaagagt caaaggtaat ggccacc ttc 14940 
tcaaggtatg gtatggagcc agagtaggtg 15000 
ggggtttcct ttgtcttcaa gcgtgtgaag 15060 
gatgggatgg agatgggcac gttggtggcg 1512 0 
gcggaaaggt agtcattaaa ggactggtcg 15180 
gtggaggccg tgttgtgggc catggggaag 15240 
ttaatgctag ccccgtcaac tctaagatcg 15300 
acatccttcc tgaagttcca ttcatatgta 153 6 0 
atggcaaaaa acttttgggg cacctgaatg 15420 
aacatggagc gataacggag gcccgcattg 15480 
atgtagtcca gagaccagcg cgccccaagg 15540 
accactcgct tgttcatgta gtcgtaggtg 15600 
gggttgtatt ttagcttgtc tggcaggtac 15 660 
cataggttgg catttaggtt aatttccatg 15720 
cgtgttgcaa aagtttcatc ttttgtccat 15780 
ccattagcct taatagcttg ataggtgtca 15840 
taatttggca attcatcctc agttccatgg 15900 
ctgtctacag cctgattcca catagaaaaa 15960 
aagagttgat aggacagctc tgtgtttctg 16020 
gatgcctgac cagcaagaac acccatgfcfcg 16080 
aaattgtccc tgaaagcaat gtaattgggt 16140 
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ctgtttggca 
ttgtaagaca 
tttggtttag 
aagtcaacct 
aaaggatttg 
agcactctcc 
ggttctggtt 
gatcctattt 
acatgtgttt 
tcttcatctt 
cactcacagg 
ggcttaaaag 
tcaagcacac 
tcctcgcggt 
tcgtgcgtag 
gtggcgcggg 
atgtgcatgt 
acggcggctc 
ttagctattt 
ttcagtgtgc 
gcacggcgca 
acaggtttct 
cccagcactc 
gcctcctcgt 
cgcggacgct 
tctacgggag 
aggtccacca 
tgctctttca 
tagttcttaa 
gcgccaccgc 
Gcgtgtcgcg 
accaagcgag 
taatcagtgt 
cgcggcgatc 
gcgccgccgc 
cctcctacgg 
ggccatatct 
tgttggcagc 
cagcacgggc 
agfccgctcgt 
aggatgtagg 
cgggcgccgg 
tttggacgcg 
ggacgtttgt 
tatctgaacg 
gcggacctcg 
ctggcgcttg 
gccgcgcttt 
taggctaggt 
tccgtaggca 
cacctcatca 
ggtggtgccg 
gggcgcggtt 
ctttttcttt 

ggggccatag 

catagcttcg 

ttttcttgca 
cgagtcgcgc 



tagattgttg 
gatgtgtgtc 
tagcattgcc 
ttggaagagg 
taggcctggc 
ctcctgccgc 
gataggaagg 
gtagcccgct 
tcttagtagc 
cctcttcttc 
agttaggagc 
taggccccct 
ggttgtcacc 
ccacagggat 
gtgccaccgt 
caaactgcac 
aagaccactg 
agcagctcct 
agaagcatcg 
tttgccagtt 
gggacgcgcg 
gctgggtgtc 
cggtagccat 
acgagggagg 
tttcgccacg 
ggcggggatc 
ccccgctaat 
acttgtccct 
tggtggaacc 
tcatattgct 
gggccagaga 
cgtgagactc 
ctctgcgcct 
agtggaataa 
cggtggtgcg 
tgcattcttc 
gcaagaacca 
accagggtcc 
tggcgacggc 
cttctggggc 
gcatattcgg 
ggggctgaaa 

gccgcagcgg 
gtctccatgc 
tccacggtct 
ggctccagcc 
ggtaccatca 
tcttcggacg 
gttgggttgc 
aactccccga 
tacacgcgct 
ggtcgcaaaa 
gcgtgcagca 
ttgacccgct 
atctccggcg 
ttgcgcgccg 
atctagttgc 
acccagtaca 



acccaacata 
tggggtttcc 
ttgccggtcg 
cacccctttt 
ataagatcca 
attagcatca 
atctgcgtat 
ttttgtaatt 
ctgatctcga 
atcctcggca 
gcccttggga 
gtccagca.cg 
cacagccagg 
gaaccgcagc 
g9"ggtttcta 
cagcccgggg 
cggcatcatc 
ctggcggcga 
tcggcgcttc 
gccactggct 
gctagggcgg 
agcgggggga 
gggcgcgatg 
ctcatctatt 
cGcctctgga 
aagcttactg 
gccagaggcc 
cagcatctgg 
gaaattttta 
ggtgccgata 
cgcaaagttg 
cagacttttt 
gcaaggccac 
ggaggggcag 
cacgacgcat 
ctcggaatcc 
caaagaccgg 
tgcctccttc 
gacggcggcg 
ggtaggtgta 
gcagtagtgc 
cgcgaaacat 
ccgcctgcac 
cctctgtggc 
gcacgcccag 
caggctccac 
gctgcacggt 
gtgcaagcgt 
cctcgtccag 
ggcgctcgtt 
tgtaggtgcg 
cacgtcttac 
gttccacctc 
ttagctttcg 
cgatgacctg 
ccgccgctgg 
gcggggggcg 
cgttgcccct 



gctttagaat 
afcatttacat 
ttcaaagagg 
tcatccggaa 
tagcatggtt 
gcttcgttcc 
acaggtttag 
gtttctccag 
gcgttttgct 
actgcccggc 
gctagagcgt 
ccgcggatgt 
gtgaaccgcg 
gtcaaacgct 
aacttgttat 
ctcaggtact 
gaaggggtag 
catggacgca 
agggattgca 
acgggccgca 
gttacaacaa 
ggcaggtcca 
ggacgggtgg 
tgcgtcacca 
gacactgtct 
ttaatcttat 
aggccatcta 
cctgtgctgc 
atgccgctcc 
tcttgccagt 
atgtcttcca 
attttgattt 
ggatgcaatt 
gataccgccg 
gccgcccgtc 
cggcaccggg 
cttttaaacg 
gcgagccacc 
gcgggttcca 
gccacgatag 
gctggcggtg 
ccacgggtcc 
cgcggcatct 
agtggcaata 
tcccggtgcc 
ggtcattttt 
gggtgccaag 
gggcagcacc 
cggcaacgcc 
ggcctgctca 
ggtggagcgc 
gcgtcgacct 
gtcgtcaagt 
gggcttgtaa 
gagcatctct 
atacatacaa 
ggtgcgcacg 
gcgaccctga 



tttcatcacc 
cttcactgta 
tagtatttga 
ccagaacgga 
tcatgggagt 
actgagattc 
cttgtgtttc 
acaaaggagc 
cttcttcttc 
cgctatcttc 
tgtaggcagt 
caaagtacgt 
ctttgtacga 
gggaccggtc 
tcaggctgaa 
ccgaggcgtc 
ccatcttgga 
tacatgacac 
cccccagacc 
tcgatcgcgg 
cggcggacgg 
gcgttacagg 
tgggcaggcc 
gagtttcttc 
ccacggccgg 
tttgcactgc 
ccaccttttg 
tgttccaggc 
acagcgagcc 
ttcccatgaa 
ttctacaaaa 
ttccacatgc 
ccgggcacgg 
cgcatgcgac 
aggccgtggc 
aaacggaggc 
atgctggggt 
ctgcgcacgg 
gtggtggttc 
ccgggggtag 
ccgtacttcc 
gtttgcacct 
gccaccgccg 
ctagtgctac 
acctgcttga 
tccaagacat 
tcaccagact 
tgctgcagtg 
aacatgtcct 
agcaggtcct 
tcaccgggcg 
ttccactgta 
tcatcatcat 
tcctgctctt 
tctttgattt 
cagtacgagfc 
ggcacgcgca 
gtcatagcac 



ttttccaggt 
caaaaccact 

gaagaattgc 
ttgaccacca 
tgttttttta 
gccaatttga 
tgcattgtct 
ctgggcatag 
ctcttcttca 
ggtttgttcc 
gccggagtag 
ggaagccata 
gtacgcggta 
tgtggttacg 
gtacgtctcg 
ctggcccgag 
aagcgggcgc 
atacgacacg 
cacgatgctg 
accgctggcg 
ccctggcagc 
tgtgtgctgg 
ttgctttagt 
cctgtcgggc 
tggaggctcc 
ctggttggcc 
ttggaaattt 
cttgctgcca 
ccagctgaag 
cgggcgcgag 
tagttacagg 
aacttgtttt 
cgccaatcgc 
ggtgcgacgc 
cggccatgcc 
ggcaggtgag 
ggtagcgcgc 
aaatcggggc 
ggcgtcgggt 
gcgcgatgga 
tggaacggcg 
ccgtagaggt 
aggcaaccgg 
tggtggtggg 
ttggccgcac 
cttccagtcg 
cgcgctttag 
tcacgggctt 
tatgccgctt 
cgtcgccgta 
taaaaactac 
cccgccgcct 
catcatcttt 
ccttcttcgg 
tgcgcttgga 
ctaagtagtt 
ggccgctaac 
taatggccgc 



16200 
16260 

16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
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ggctgctgcg 
cagccttcga 
ggccgccgcc 
cctccgccgt 
cgcgtccact 
cgcgtcaatg 
gggcgcgcgg 
gccaaacatc 
gatggacatg 
ggccttgtaa 
tcctccgatg 
actgacggtg 
attgaacacg 
gttgtagaag 
gatctggctc 
gacctgctcc 
ggtctgaggg 
gtcattgctt 
cggtttcttc 
tgcctcggcc 
atgatcgttc 
cgcgccgctg 
ttttaagctt 
gtcatcgtag 
gttgctcagg 
atggaaggct 
cagtctgaag 
cctgcccact 
ctcggagtag 
tttgtcctta 
gaactcgttc 
cgaacggtcg 
ttgaatgacc 
cgttgacttg 
gtaacggatg 
gtccagggga 
gctctcgtag 
aatacaagaa 
cttttttttt 
tgccactccc 
ccagaccGcg 
ccctgtgctc 
cgttgttggg 
ttaggfcactc 
ctatcaagct 
acaagctgct 
gttgcaactc 
agaggaatcc 
gtagccagtg 
gatgcaagta 
ccatgttgcg 
taaggctcac 
gcaacgaggg 
tgcccgtgtc 
cgtccacatg 
tgtcccggga 
agttgggcgt 
acagcccctg 



gcggccgctc 
gcggcccgca 
gcgcgttggg 
ctcttcattt 
gtggacactg 
gcgtcatcga 
tagtgcccgc 
ttgcttggga 
tttgctcaaa 
acgtaggggc 
ctgttgcgca 
gtgatggtgg 
tgggtcagag 
ctcttggagt 
gtggagcgga 
gagccgcagg 
tcgccgtagt 
attaggttgt 
tgaggcttct 
tcagcgcgct 
atgtcctcca 
ccactgttgt 
gcctggtagg 
gtgatcctaa 
cggctgtggg 
tcgtttgtat 
ttgcgggtgt 
ttcaagtagt 
ttgccctcgg 
gtaagcgagc 
acatttggca 
tcaagattga 
gtggttagaa 
ttgtccacaa 
ctgtttctcc 
gcatcgaagg 
gagggaggag 
aaccaacgct 
ttttttttaa 
tcccaaatcc 
ctgacggtcg 
ctgcgcatac 
aaatgccgga 
ctcctcgccc 
tggaaatggg 
tggcctgcgg 
tagcagggtc 
atcgttaccc 
cgggttcaag 
gtccattagg 
cgcggtcatg 
gctctgctgc 
gatcttctgc 
ctcctgcccc 
cgcctgacct 
cacgctgcca 
cagcaagcta 
caagttcttg 



gtcgcctgga 
tggccgcccg 
cggcagtgcc 
tagcataacg 
gtggcggcgt 
cggtggtgcg 
gcacgcgcac 
agcgcaggcc 
aagtgcggct 
aggtgcggcg 
gcggtagcgt 
gggctggcgg 
aggtaaactg 
gcacgggcaa 
aggtcacggg 
ttacgtcagg 
tgtatgcaag 
aactgcgttt 
cgacctcggg 
fcctcctccgc 
ccggctgcat 
tgccgccgcc 
cgtccacatc 
agccctcctg 
tgaagtccac 
ataccccagg 
caaactttac 
gctccacgat 
gcagcgtgaa 
gcgacaccat 
tgttggtatg 
tggtctgtgt 
agttgctgtg 
ggtacacacg 
ccccggtagg 
gggaacccag 
gaccttcctc 
cggtgccatg 
aacattctcc 
aggacgctgc 
tgcctttgac 
gtcttccatc 
ggcaggttct 
agcaggcgcg 
ctactcgcat 
aagctttcct 
tgcggttgcg 
tcgggcacct 
atggcattgg 
cgattgataa 
tccagcgcca 
acatagcgca 
cgccggttgg 
agcgcgcggc 
atggcctcgc 
ctgtccgtga 
gacacggtcg 
aaagcctggc 



cctggggggc 
tcggccggtg 
gggtcggcgg 
ccgggctccg 
gggcgtgtag 
cccagtgcgg 
tgggtgttgg 
ccagcctgtg 
cgataggacg 
tctggcgtca 
cccgtgatct 
gcgcgccaaa 
gcggatgagc 
cagctcggcg 
gtcttgcatc 
agtgcaaagg 
gtaccagctg 
cttgctgtcc 
ttgcgcagcg 
ccgtgtggca 
tgccgcggct 
tgcgccatcc 
caacagtgcg 
gaagggttgc 
cccgcatcct 
catgacaaga 
cccgatgtcg 
cgcgttgttc 
ctccacccac 
cacccgcgcc 
caggatggtt 
gcttgcctcc 
gtcgttctgg 
ggtggtgtcg 
ccgcaggtac 
cgccgccgcc 
atacatcgcc 
gccttggtga 
ccagcctggg 
tgtcgtctgc 
gacgggtggg 
tactcatctt 
tttcgcgctg 
ggcgggtggt 
ctgaccgcgg 
ttcgcagcgc 
gggaaaacac 
caaatccccc 
tgaaatactc 
acggccggtt 
cgctgggcgt 
agatgcgctc 
tcagcaggta 
tgacacttgt 
ggtacagtgt 
agggcgctat 
cgcggtcgcc 
tcaggtttac 



acagtgacaa 
cgacgtgcgc 
cggtggcgac 
cgcaccacgg 
ttgcgcgcct 
ccgcgtttgt 
tcggagcgct 
ttattgctgg 
cgcggcgaga 
gtaatggtca 
gtgagagcag 
atctggttct 
tgggagtaga 
cccaccaccg 
atgtctggca 
agggtccatg 
cggtactggg 
tctgtcaggg 
ggggcggcag 
aaggtgtcgc 
gccgcgttgg 
ccgccctgtt 
ggaatgttac 
cgcttgcgga 
ggcagcaaaa 
ccagtgactg 
ctttccagaa 
ataaggtcta 
tcatatttca 
ttaaacttat 
ttcaggtcgc 
cccgggctgt 
tagttcaggg 
aataggggtg 
cgcggaggca 
actggcgccg 
gcgcgctgca 
gttttttatt 
gcgaaggtgc 
cgagtcatcg 
cgggcgcggg 
gtccactagg 
cggctgcagc 
gcgagtgctg 
ggccgcagcg 
cgcctctgcc 
gctgtcgtct 
ggtgtagaaa 
ggggttcacg 
tgaggcatac 
taccccgtcg 
ctcctcgctg 
gttcagggtt 
aatctcctgg 
cagcaagtga 
tagcagcagc 
tgtgggagcc 
ggtctgcagg 



tacccgcggc 
ggttaagcag 
gtgctacgcg 
tctgaatggc 
cctccaccac 
gcgcgcccca 
tctttgcccc 
gcgatataag 
ctatgcccag 
ctcgctggac 
gaacgttttc 
cgggaaagcg 
cggcctggtc 
gaaagttgct 
acgaccagta 
agcggatccc 
tgaaggtgct 
gtttgatcac 
cttctgccgc 
cgcgaatggc 
agttctcttc 
cggtgtcatc 
caccctccag 
tgcccaacaa 
tgatgtctgg 
ggtcaaaccc 
ccccgttctg 
tggtcatggt 
gctccacctg 
tggtaaacat 
cgccccagtg 
agtcattgtt 
atgccacatc 
ccaactcaga 
caaacggcgg 
cgctcaccac 
tactaagggg 
fctgcatcatg 
gcaaacgggt 
tcctcccaca 
ccgggcacat 
ctctctatcc 
agcgagttgt 
gtaaaagacc 
cctagatcgg 
tgctcgcgct 
atgtcgtccc 
ccagggggcg 
gcggccgcgc 
atgcccggtt 
cgcatcaggt 
tttaaactgt 
gcctccaggc 
aaagtatgct 
cctaggtatg 
aacaggcgcg 
cgcacccccc 
ccttgtctac 



19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
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tggtctggaa 
ccattagccg 
cgggctgtgt 
gcgcgcgctc 
tctgcacgtt 
cggcctcatc 
tctcgtgcgt 
tggcctgcgg 
ccatgacctg 
cccgcaccgc 
ctcgccgfccc 
gccagcgcgc 
tcggcttggg 
gccgctgccc 
ctcgcgccgc 
gtcgtcgctc 
ctccttgtct 
actgagcatg 
caccactatg 
tatgttfcagc 
cagctgcgcc 
cagatgcatc 
aaagcttttt 
ggcggccgcc 
ctcctcgcgc 
ctttcgatcc 
cggcacgtac 
gggcgctcct 
gcgccgcggg 
tgacgcggta 
ctgctgagga 
gggctcgtcc 
cccccgttcg 
cttggaaaat 
taaccgctta 
tggaaggaag 
cgcgggcggt 
gtccgccatg 
gctctccttt 
cccgaccatg 
gcgcgcctcc 
tcgccttatg 
ccgccagccg 
ctacgtcttt 
tcaccagacc 
ataccaccgc 
catattagcc 
ggtcacacgc 
gatagatgca 
catgcaagac 
ccgcctgcgc 
tctcaagacc 
ccgccacccg 
cgcctttctc 
cggagtacct 
cagccccccg 
cgagaacggc 
ctttgtcgac 



aaaatagtct 
cagtgcgctc 
actcaggaag 
acgctgcgcc 
gccgctgttg 
gggccggatg 

ggggtttgcg 

ctgctgccgg 
gcgccagtcg 
cgggtccgtt 
tctggctcgt 
gcgggtgcca 
gcccagcgca 
gtgccagcca 
cggctcacgc 
aaggtaagca 
atgggaacgt 
gaatagttaa 
ctctgcagaa 
agcgcatccc 
atgagcggct 
agtcctatag 
tgaaagttaa 
acgtgtgcgc 
agcaaccgct 
cgcatctcct 
gcctcgcgcg 
agccgcgcca 
ggttcgtaat 
ggagaagggg 

ggggggcgca 

ctgtttccgg 
ccgcagtccg 
aaccctccgg 
cgccgcgcgc 
ccaaaaggag 
aaccgcatgg 
atacccttgc 
tgcacggtct 
gagcactttt 
accaccgccg 
ttggaagacc 
ccgccgcact 
gactcaaggg 
gttaactggt 
tttgtggaca 
gagcgcgttg 
atgggaggaa 
agagatgcag 
tactacaaag 
attcagcagg 
gcctacttta 
ctgccgcccg 
gagaggtttt 
acacaacaat 
ccaacccata 
cgcgccgtca 
cgcctcccgg 



ggcccggact 
acaaagttgg 
gcgtttagtg 
acggcctcgc 
taacgagcca 
gccctgtttt 
cgcgccggga 
aacgcgtcag 
tccgtggagt 
gcgtcttgca 
actcatcgtc 
ccgccagccc 
ggtcagcgcc 
gggccctttg 
tcacggcctt 
ccttcaacgc 
aaggggtatg 

tggcggccac 

tgtttatcaa 
tgaatgcctc 

tgctatttgg 
ccacctcctc 
tctcctggtt 
gcgcgggact 
cgcggttcag 
cgggctcctc 
tgtcacgctt 
ggccctcgcc 
caccatctgc 
agggtgccct 
tctgccgcag 
aggaatttgc 
gccggcccga 
ctacagggag 
ggccagtggc 
cgctcccccg 
atcacggcgg 
gaatttatcc 
agagcgtcaa 
tgccgctgcg 
ccggcatcac 
tcgcccccgg 
ttttggtggg 
cttactcgcg 
ccgttatggc 
tggatgactt 
tcgccgacct 
gagggcgcca 
gacaagagga 
acctgcgccg 
ccggacccaa 
attacatcat 
ccacggtgct 
ccgatccggt 
tgttgagatg 
accgggacat 
ccgagaccat 
tgcgccgtcg 



ggtacacctc 
tgtagtcctc 
caaccatgga 
gcacatcccc 
cgcgctgaag 
cggccagcgc 
ccaccgcttc 
ggttacgcgc 
taaggccgga 
tcatctgatc 
ctcgfccatat 
aggtccggcc 
cgcgtcaaag 
caggctgtgc 
gtggatgcgc 
catgcgcatg 
gtatatcttg 
cttgtcagcc 
atcgagcagc 
gttgtccctg 
gttttgctcc 
gcgcgccaca 
caccgtctgc 
aatcccggtc 
gccatgccgc 
tcccfccgcgg 
cagctgcacc 
ctcctccaag 
cgccgcgtca 
gcatgtctgc 
caccggatgc 
aagcggggtc 
gactcgaacc 
cgagccactt 
caaaaaagct 
ttgtctgacg 
acggccggat 
accagaccac 
cgactgcgca 
caacatctgg 
ctggatgtcc 
agccccggcc 
atatcagtac 
tctcaggtac 
caactgcact 
ccagtctacc 
ggccctgctt 
cctacggcca 
aggagaagaa 
atgtcaaaac 
ggacatggtg 
cagcagcacc 
cagcctacct 
cgatgcggac 
catcgttagc 
gacgggcggc 
gcgccgtcgc 
tcgccgccgt 



actttgcggt 
ctgtccccgc 
gcccaggttg 
caccagccgg 
cagcgcgtcg 
gtttacgatc 
cagaattgcg 
agtcagcgac 
cggctggctc 
agaaacatca 
tcctccacgc 
ccagctgcct 
taggactcgg 
atcagctcgc 
tcgttgcgat 
tagaacccct 
cgggcgtaaa 
aggctcaagc 
cagcggccct 
ctgtgctgca 
agcgcgctta 
agcgtgcgca 
tcgtacgcgg 
cgcgcgtcgg 
aactcgcgcc 
tcgcgaaaca 
cttgggtgtc 
tccaggtagt 
gccgcggatg 
cgctgctctt 
atctgggaaa 
ttgcatgacg 

gggggtcctg 

aatgctttcg 
agcgcagcag 
tcgcacacct 
ccggggttcg 
ggaagagtgc 
cgcctcaccg 
aaccgcgtcc 
aggtacatct 
accctacgct 
ctggtlgcgga 
accgagctct 
tacaccatca 
ctcacgcagg 
cagccgatga 
aactccgccg 
gaagtgccgg 
gaagcctggg 
cttctgtcga 
tccgccagaa 
tgcgactgtg 
tcgctcaggt 
gccgtatccc 
gtcttccaac 
cgcggggaga 
gtcccccctc 



gtctcagtca 
ggcacgttgg 
ccctgctgct 
tccaggttgg 
tagaccaggc 
gccagcacct 
gagagccggt 
atgatgcggt 
tgcagcgccg 
ccgcttagta 
cgccgacgtt 
ccagggcgcg 
cctctctatc 
ggtcgctgag 
aaacgcccag 
cgatctttac 
acttgcccag 
tgcgctcctg 
cgggctctac 
ctataaggaa 
caaagtccca 
cgtggttgtt 
ttaccaggtc 
gctcaaagtc 
ctgcgtggaa 
ggttctgccg 
gctcaggaga 
gccgggcccg 
ttgcccctcc 
gctcttgccg 
agcaaaaaag 
gggaggcaaa 
cgactcaacc 
ctttccagcc 
ccgccgcgcc 
gggttcgaca 
aaccccggtc 
ccgcttacag 
gccagagcgt 
gcgactttcc 
acggatatca 
ggcccctcta 
cttgcaacga 
cgcagccggg 
acacgggcgc 
tgcagcaggc 
ggggcttcgg 
ccgccgtagc 
tagaaaggct 
gcatggccga 
ccatccgccg 
acaaccccga 
actggttaga 
ccctcggtgg 
tgccgcacgg 
tgcgcccccg 
tgatcgagcg 
ccccaccgcc 



23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
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gccagaagaa gaagaagaag gggaggccct 
ccctgtagcc tttgagcgcg aggtgcgcga 
ggaggagtta accgtgtcgg cgcgcaactc 
cgaggccatg gagcgccttg aggccttggg 
ggttatgtac ttcttcgtgg cagaacacac 
cctgcgaaac tacgccgtct tcgcccggca 
gcgcgcccgc gatgccgaag ggggcgtggt 
caacgccttc tcgcagctca tggcccgcat 
agccggacgc ggagatctcc aggaggaaga 
tcaagacaac tcaggagacg tgcaggagat 
aattgattct gtcgaactct ctttcaggtt 
gaggcgccag attcaggaga tcaaccgccg 
gcagcaccag ctcctgcccg cgcgcggcgc 
tcccgagccc cccctacctc cgggggcccg 
aaggacaccc ccgcggccca ccgcccgccg 
ggcctcttgc aagtcatcga cgccgccacc 
gacctagccc gcgccctgac ccggctatgc 
ctgtcgccgc gggagctcca gaccatggac 
ctccgaccgc cgcgcgcgga catctggact 
accataactc ccctcgagca gccagacggt 
cagccaaacc cgccaggcga ggggctcaaa 
caggtcaacc tcgtgcagga tgtacagccc 
tacaaaagcc agcacgagtg ttcggcccgt 
agccactcct ccaactggtg gcgggagatc 
accgagcgtc tctttgtcac ctacgatgta 
aagcagctcg tgcccttcat gctggttatg 
gccgcgcgag acctagccgt ggaccttgga 
ttctactgca tcaccccaga aaaaatggcc 
cacctgcaaa tgctaatggc ccgtgacctg 
cttgcagact gggccctgtc agaacacggg 
gaacttaaaa aattgccctc catcaagggc 
ggccacaaca tcaacggctt cgacgagatc 
tccgaggtgc cgggaccctt ccgcatcaca 
cttttcaacg atgtcacctt cgccctgcca 
ttgctctggg agcagggcgg atgcgacgac 
atggttaggg acacctttgc gctcacccac 
gcgctacccg tagaaaaggg atgctgcgcc 
ggctcttacc gfctcggaggc cgacgggttt 
gagtttgtcc tcaaccgcga gctgtggaaa 
aaggaaaccc tggactactg cgccctagac 
ctgcgcgact cctacgcctc cttcgtgcgt 
aacgtcttcc agcgtccaac catatcatcc 
ttccgagcag agcagcccgc ccgtagcaac 
gaactatacg attacgtgcg cgccagcatc 
ggaatactca gagagcccct ctacgtttac 
acccacccca tgccatgggg tcccccactc 
gcatggcagc aggcgctaga cttgcaagga 
ctgcccgggg tctttaccgt ggacgcagac 
ccgccattct gttcgcgcaa gggcggccgc 
gaggtagcca ccagcgttga ccttgtcacc 
gtgcccgacg agcgcaccac cgtctttccc 
cagctaaaca tcgcggccaa ggagcgcgcc 
atcgccaagt tgctgtccaa cgccctctac 
aagattgtct tttctgacca gatggacgcg 
gtgaatatca aatcctcctc gtttttggaa 
gcttttgaga gggagtactc accccaacag 
agtgaggacg aacgcgcccc cacccccttt 
gtggcctaca cctataaacc aatcaccttc 



tatggaagag gagattgaag aagaagaggc 26640 
cactgtcgcc gagctcatcc gtcttctgga 26700 
ccagtttttc aacttcgccg tggacttcta 26760 
ggatatcaac gaatccacgt tgcgacgctg 2 6820 
cgccaccacc ctcaactacc tctttcagcg 26880 
cgtggagctc aatctcgcgc aggtggtcat 2 6940 
ctacagccgc gtctggaacg agggaggcct 27 000 
ctccaacgac ctcgccgcca ccgtggagcg 27060 
gatcgagcag ttcatggccg aaatcgccta 27120 
tttgcgccag gccgccgfcca acgacaccga 27180 
caagctcacc gggcccgtcg tcttcacgca 27240 
cgtcgtcgcg ttcgccagca acctccgcgc 273 00 
cgacgtgccc ctgccccctc tcccggcggg 27360 
cccgcgtcac cgcttttaga tgcatcatcc 27420 
cgcggtaccg tagtcgcgcc gcggggatgc 27480 
aaccagcccc tggaaatcag gtatcacctg 27540 
gaggtaaacc tgcaggagct cccgcctgac 27600 
agctcccatc tgcgcgatgt tgtcatcaag 27660 
ttgggctcgc gcggcgtggt ggtccgatcc 27720 
caaggacaag cagccgaagt agaagaccac 27780 
ttcccactct gcttccttgt gcgcggtcgt 27840 
gtgcaccgct gccagtactg cgcacgtttt 27900 
cgcagggact tctactttca ccacatcaac 27960 
cagttcttcc cgatcggctc gcatcctcgc 28020 
gagacctata cttggatggg ggcctttggg 28080 
aagttcggcg gagatgagcc tctggtgacc 2 8140 
tgggaccgct gggaacaaga cccgcttacc 28200 
ataggtcgcc agtttaggac ctttcgcgac 28260 
tggagctcat tcgtcgcttc caaccctcat 28320 
ctcagctccc ctgaggagct cacctacgag 28380 
accccgcgct tcttggaact ttacatcgtg 28440 
gtgctcgccg cccaggtaat taacaaccgt 2 8500 
cgcaacttta tgcctcgcgc gggaaagata 2 856 0 
aacccgcgtt ccaaaaagcg cacggacttt 2 862 0 
actgacttca aataccagta cctcaaagtc 28680 
acctcgctcc ggaaggccgc gcaggcatac 28740 
taccaggccg tcaaccagtt ctacatgcta 28800 
ccgatccaag agtactggaa agaccgcgaa 28860 
aaaaagggac aggataagta tgacatcatc 2892 0 
gtgcaggtca ccgccgagct ggtcaacaag 2 8980 
gacgcggtag gtctcacaga cgccagcttc 29040 
aactcacatg ccatcttcag gcagatagtc 29100 
ctcggtcccg acctcctcgc tccctcgcac 29160 
cgcggtggaa gatgctaccc tacatatctt 2 922 0 
gacatttgcg gcatgtacgc ctccgcgctc 2 928 0 
aacccatacg agcgcgcgct tgccgcccgc 29340 
tgcaagatag actacttcga cgcgcgcctg 29400 
cccccggacg agacgcagct agacccacta 29460 
ctctgctgga ccaacgagcg cctacgcgga 2952 0 
ctgcacaacc gcggttggcg cgtgcacctg 29580 
gaatggcggt gcgttgcgcg cgaatacgtg 29640 
gatcgcgaca aaaaccaaac cctgcgctcc 29700 
gggtcgtttg ccaccaagct tgacaacaaa 29760 
gccaccctca aaggcatcac cgcgggccag 29820 
actgacaatc ttagcgcaga agtcatgccc 29880 
ctggccctcg cagacagcga tgcggaagag 29940 
tatagccccc cttcaggaac acccggtcac 30000 
cttgatgccg aagagggcga catgtgtctt 30060 
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cacaccctgg 
tccttcgtgc 
gaggaccgcg 
agccttttcg 
aaaaagcatg 
gaatgcgaga 
ctcgcgccca 
aagggcaagc 
aaatgctacc 
agcctcaagc 
actacgctga 
caccgactac 
atcgagatgc 
cgctcaaaag 
aagaactgct 
aagtcaggga 
ctcttaacta 
agtcgcagct 
ttttcttcat 
tgcaaatctg 
gcaccctccg 
atgacgttag 
tcattatgga 
acgcatttcc 
tggttctgca 
tacagtccaa 
taaacactta 
ggcaccacgc 
aagctctgca 
tccagagtca 
gctggtCGcg 
tgatcaaaat 
ggaagccttc 
aagttcccgg 
ggatgaacgg 
cccgcgggcg 
tccagacacg 
aatgctggag 
atatgagctc 
acagatacaa 
cccatcgatg 
gtattcagtg 
caattaatac 
acgttgtgat 
aggttacggg 
cgtttccgtt 
aggaaacgac 
ttgtccgtgg 
aaggccagga 
gacgagcatc 
agataccagg 
cttaccggat 
cgctgtaggt 
ccccccgttc 
gtaagacacg 
tatgtaggcg 
acagtatttg 
tcttgatccg 



agcgagtgga 
tggcctggac 
gaacaccgct 
tcaccgagcg 
ggggaaacct 
ccgtctgcgg 
agctctacgc 
tgcgcgccaa 
tggccgacgc 
gcacGctggc 
cgaggaccct 
tgccgtacag 
cgtagagcac 
catgcctacg 
atcgctgggc 
catgcttaac 
ccagttgcag 
gctcaggaac 
cgccccgcag 
tgagggtaac 
cccgcgcttt 
tgatcccaga 
cgaatgcatg 
ttctaagcta 
caacatgaat 
gatgcatctc 
caccaagggc 
ccagcgctcc 
gtggtgctac 
cctttaccac 
ggcctaccgc 
ccaaacagag 
agggcagaaa 
gtcaaagaat 
gaagctgcac 
gtggctgcag 
gtctcgtagg 
cccatcacat 
acaatgcttc 
aactacatga 
gcaaacagct 
tcgctgattt 
gatacctgcg 
atgtagatga 
gcggcgacct 
cttcttcgtc 
aggtgctgaa 
aatgaacaat 
accgtaaaaa 
acaaaaatca 
cgtttccccc 
acctgtccgc 
atctcagttc 
agcccgaccg 
acttatcgcc 
gtgctacaga 
gtatctgcgc 
gcaaacaaac 



ccccctagtg 
gcgagccttc 
cgaggacagg 
tggacaccgg 
ggtttttgac 
ggcctgcggc 
ccttaaaagt 
gggccacgcc 
gcagggcgaa 
cagcgcgcag 
gcgcccgtgg 
cgaaagccgc 
gtgaccgagc 
gcggacggcc 
ggcgagcgcc 
gaagtggccc 
ccggtaatag 
ctgctttctt 
gtagacatga 
tacgcccctg 
gtaaaaatgg 
aatatcttcg 
gaaaatctcg 
catgacaaat 
ccccggaggg 
atatccccac 
ctgcGCctgg 
tgctacgact 
ctccacccca 
gtcctggaaa 
gcgcgcaaaa 
tctggttttt 
cctgctggcg 
ccaattgtgc 
tgcttgcaag 
cggctgaagc 
tcaaggtagt 
tctgacgcac 
catcaaacga 
gacccccacc 
attatgggfca 
gtattgtctg 
tcataattga 
taatcattat 
cgcgggtttt 
ataacttaat 
agcgaggctt 
ggaagttaac 
ggccgcgttg 
acgctcaagt 
tggaagctcc 
ctttctccct 
ggtgtaggtc 
ctgcgcctta 
actggcagca 
gttcttgaag 
tctgccaaag 
caccgctggt 



gacaacgacc 
gtctcagagt 
cctctcaagt 
ctcatggaaa 
cccgaacggc 
gcggatgcct 
ctgcactgcc 
gcggaggggc 
gaccggcagc 
cccggagcgc 
aaagacatga 
cccaacccgc 
tgtgggaccg 
tcaaaccgtt 
ttctggcgca 
ccctgctcag 
gtgtgattta 
cccagctgat 
tccccccatc 
ggccggatgg 
cctatgacga 
cccaggccgc 
gaggtcacaa 
ttcccaagtg 
atatggctgg 
gtatgcaccc 
caatcagctt 
ggatcatcta 
gagacgggct 
aaatacacag 
cccctaaata 
atttatgttt 
cagatccaac 
caaaagagcc 
cgggctcagg 
ggcggcggag 
agagtttgcg 
cccggcccat 
gttggcgctc 
ttatatattc 
ttatgggtgc 
aagttgtttt 
ttatttgacg 
cactttacgg 
cgctatttat 
gtttttattt 
tttggcctct 
ggatccaggc 
ctggcgtttt 
cagaggtggc 
ctcgtgcgct 
tcgggaagcg 
gttcgctcca 
tccggtaact 
gccactggta 
tggtggccta 
ccagttacct 
agcggtggtt 



gctacccctc 
ggtccgagtt 
ctgtatacgg 
ccagaggtaa 
cagagctcac 
actccccgga 
cctcgtgcgg 
tggactatga 
gcttcagcac 
accccttcac 
ccctggcccg 
gaaacgagga 
cctggaactg 
gaaaaacttt 
tttggtcagg 
ggatgacggc 
cgggcccacc 
ctcccctacG 
tgaacfccaaa 
aaccattata 
tctcatcctg 
cgcccgtggg 
gggcgtctcc 
caccggatac 
gaacatagcc 
atcccagctt 
gc'tactgaaa 
caacaccacG 
tatgcccatg 
gaccctcaac 
aagacagcaa 
taaaccgcat 
agctgctgag 
gtcaacttgt 
aaagcaaagt 
gctgcagtct 
ggcaggacgg 

gggggcatgc 

atggcggcgg 
tttcccaccc 

tagcgacatg 
tacgttaagt 
tggtttgatg 
gtcctttccg 
gaaaattttc 
aaaataccct 
gtcgtttcct 
cgcgagcaaa 
tccataggct 
gaaacGcgac 
ctcctgttcc 
tggcgctttc 
agctgggctg 
atcgtcttga 
acaggattag 
actacggcta 
tcggaaaaag 
tttttgtttg 



ccacttagcc 
tctatacgag 
ggacacggac 
g-aaacgcatc 
ctggctcgtg 
atcggtattt 
cgcctcctcc 
caccatggtc 
cagcaggacc 
cgtgacccag 
tctggacgag 
gatatgctgg 
cttggtcaaa 
gcttccttgc 
gaaaacatgc 
agctgcagct 
ggctgcggta 
ccggaaacgg 
gcgtgggaaa 
ccgcagtctg 
gaacacaact 
cccattgcca 
aagttcttcc 
actgtgctgg 
aacctaaaaa 
aaccgctttg 
gacattttta 
ccgcag-catg 
tatctgaaca 
gaccgagacc 
gacacttgct 
tgggagggga 
aaacgacatt 
catcgcgggc 
cagtcacaat 
ccaacggcgt 
ggcgaccatc 
gcgttgfccaa 
ctgctgcaaa 
ttaagccGcg 
aggttgccGc 
tgatgcagat 
gcctccacgc 
gtgatccgac 
cggtttaagg 
ctgaaaagaa 
ttctctgttt 
aggccagcaa 
ccgcccccct 
aggactataa 
gaccctgccg 
tcatagctca 
tgtgcacgaa 
gtccaacccg 
cagagcgagg 
cactagaaga 
agttggtagc 
caagcagcag 



30120 
30180 

30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
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attacgcgca 
gctcagtgga 
ttcacctaga 
taaacttggt 
ctatttcgtt 
ggcttaccat 
gatttatcag 
ttatccgcct 
gttaatagtt 
tttggtatgg 
atgttgtgca 
gccgcagtgt 
tccgtaagat 
gccacatagc 
ctcaaggatc 
atcttctgca 
tgccgcaaaa 
tcaatattat 



gaaaaaaagg 
acgaaaactc 
tccttttaaa 
ctgacagtta 
catccatagt 
ccggccccag 
caataaacca 
ccatccagtc 
ttcgcaacgt 
cttcattcag 
aaaaagcggt 
tatcactcat 
gcttttctgt 
agaactttaa 
ttaccgctgt 
tcttttactt 
aagggaataa 
tgaagcattt 



atctcaagaa 
acgttaaggg 
ttaaaaatga 
ccaatgctta 
tgcctgactc 
tgctgcaatg 
gccagccgga 
tattagttgt 
tgttgccatt 
ctccggttcc 
tagctccttc 
ggttatggca 
gactggtgag 
aagtgctcat 
tgagatccag 
tcaccagcgt 
gggcgacacg 
atcagggtta 



gatcctttga 
attttggtca 
agttttaaat 
atcagtgagg 
cccgtagtgt 
ataccgcgtg 
agtgccgagc 
tgccgggaag 
gctacaggca 
caacgatcaa 
ggtcctccga 
gcactgcata 
tattcaacca 
cattgggaaa 
ttcgatgtaa 
ttctgggtga 
gaaatgttga 
ttgtctcatc 



tcttttctac 
tcagattatc 
caatctaaag 
cacctatctc 
agataactac 
acccacgctc 
gcagaagtgg 
ctagagtaag 
tcgtggtgtc 
ggcgagttac 
tagttgtcag 
attctcttac 
agaatacggg 
cgttcttcgg 
cccactcgcg 
gcaaaaacag 
atactcatac 
agcggataca 



ggggtctgac 
aaaaaggatc 
tatatatgag 
agcgatctgt 
gatacgggag 
accggctcct 
tcctgcaact 
tagttcgcca 
acgctcgtcg 
atgatccccc 
aagtaagttg 
tgtcatgcca 
ataataccgc 
ggcgaaaact 
cacccaagtg 
gaaggcaaaa 
ttttcctttt 
tatttg 



33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34616 



<210> 3 
<211> 31672 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Adenovirus 



<400> 3 

gaatcggcca 

cttattttgg 

gcgtgggaac 

ttgtagtttt 

gaagttgtgg 

gttttttgtg 

ttggcccttt 

aggttttttt 

ttctggagcg 

ttttctctcc 

tgcgggtatg 

aacctgatgt 

tggtgctttt 

gtcttatgct 

ttttttttgt 

aatggtgtct 

ctacgatgtg 

ttttatatcg 

gagtatgcgt 

cgcgctggtc 

ggatcgcggt 

tgaatttttg 

atttttacaa 

cgagatgaaa 

cctgaagggt 

attgtgcaac 

accggagggg 

gaataaaaaa 



gcgcgaatta 
attgaagcca 
ggggcgggtg 
tttaaaatgg 
gttttttggc 
gactttaacc 
ttacactgtg 
actggtaagg 
ggagggtgct 
tattaatttt 
tattcccccg 
gtttaccgag 
taatcacggt 
tataagggtt 
tattttattt 
ttttctgtgg 
cttgcttttt 
ccgcccatgc 
gtcataatca 
cgtgcagacc 
atttttgtta 
caatcatgat 
tggccggact 
attattt-ggg 
ttagccttta 
atcttacaaa 
agcgcgttca 
aaaaaaaaca 



actataacgg 
atatgataat 
acgtaggttt 
gaagttacgt 
tttcgtttct 
gttacgtcat 
actgattgag 
ctgactgtta 
attttgccta 
gttatacctc 
ggctatttcg 
tcttacatta 
gaccagtttt 
gtttttcctg 
tgtgtttaat 
tggttccgga 
tgcgcgaggc 
aacaagctta 
gtgtgggttc 
tgcacgatta 
atgttccgct 
tcgctgcttg 
taatattcgg 
catggttgaa 
cgtccacttg 
tgccattatc 
cttaafcagat 
tggttcttcc 



tcctaaggta 
gagggggtgg 
tagggcggag 
atcgtgggaa 
gggcgtaggt 
tttttagtcc 
ctggtgccgt 
tggctgccgc 
ggcaggaggg 
ctatgggggc 
gtcgcttttt 
tgactccgga 
tttacggtca 
ttgtaagaca 
gcaggaaccc 
acttacctgc 
tttgcctgat 
cataggggct 
ttttgtcatg 
tgttcagctg 
tttgaatctt 
aggctgaagg 
gatttgctta 
ggtgctggaa 
gacgtgaggg 
tgttctttgg 
Gttcattttg 
agctcttccc 



gcgtcatcat 
agtttgtgac 
taacttgcat 
aacggaagtg 
tcgcgtgcgg 
tatatatact 
gtcgagtggt 
tgtggaagcg 
tttttcaggt 
tgtaatgttg 
agcactgacc 
catgaccgag 
cgccggcatg 
ggcttctaat 
gcagacatgt 
ctttatctgc 
tttttgagca 
acgctggtta 
gttcctggcg 
gccctgcgaa 
atacaggtct 
tggagggcgc 
gagacatatt 
tgtttataga 
cagtttgcct 
ctgtagagtt 
aggttttgga 
gctcctcccg 



cataatatac 
gtggcgcggg 
gtattgggaa 
aagatttgag 
ttttctgggt 
cgctctgtac 
gttttttaat 
ctgtatgttg 
gtttatgtgt 
tctctacgcc 
gatgttaacc 
gaactgtcgg 
gccgtagtcc 
gtttaaatgt 
ttgagagaaa 
atgagcatga 
gcaccttgca 
gcatagctcc 
gggaagtggc 
gggacctacg 
gtgaggaacc 
tctggagcag 
gataaggtgg 
ggagattcac 
tttggaagcc 
tgaccacgcc 
taatcttttg 
tgtgtgactc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 
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gcagaacgaa 
ggcagcggcg 
gagagagtgg 
cagatctgtt 
ttccatttgg 
gtagggatcg 
cgctgctgcc 
cctgcagtgt 
cgcgggagga 
ttgatatcat 
gttccagtcc 
ggatggtggt 
attacaacat 
taatctacct 
ttggatacag 
actgtgctga 
tgctgcgggc 
cggagcggcg 
tgcacgatta 
aaccgcaagt 
agctgcccgg 
ggaatataac 
ggggagaaag 
ttctgtgagt 
tactgaatga 
acatgcaaca 
gtagcaaaag 
taagtgctta 
atcataggtt 
gtttgacttt 
tatggatfctg 
gagttctcca 
gtgcctgtca 
tgactcccac 
cagttaggag 
attgtaatgg 
ccgttttcat 
gactcagatg 
ttatcaaagt 
gatgcattaa 
atatacaggg 
tccacatcta 
atgttgtttg 
tcaacggtga 
aaaggaccgc 
ttaatgccca 
tcgctgtcac 
gtaatgggcc 
gactgtacgc 
gcgcctgagg 
ttaagtggct 
gtaagaccgc 
acgcgtagag 
ggggtaagaa 
aaggtgtctt 
gagggacaag 
aaaaggcggc 
gacaaagatt 



tgtgtaggtt 
catgaaggag 
atatactaca 
tgtcacgccc 
catgacacta 
cctacctcct 
cgaatgtaac 
gggatttacg 
gcttgtaatc 
gacgagcatg 
cggttccctg 
ggatggcgcc 
gccaaaagag 
gcgcttgtgg 
cgccttgcac 
tttaagtgag 
ggtgcgaatc 
gcggcagcag 
tgactctacc 
tggacagcag 
ggagtttatt 
acctaagaat 
gactgtgtac 
ttgattaagg 
aaaatgactt 
ggttcacgat 
tttcagtggt 
cctcgctagt 
tagttttatc 
gggtttttgg 
cattagttga 
ttagaacacc 
tggatgaaag 
attttgtaag 
atgggtctgg 
cccctgagtt 
tgtaatcaat 
tgtttgtatc 
ttagtccact 
aaaggtatag 
gtccctgccc 
gaattaacaa 
atgaatcata 
cacctggtcc 
ttattttaat 
agctacccgt 
tgccagagag 
ctttagtagc 
taagagcgcc 
taattgtaag 
gagtaacagt 
tgcccatttt 
agagaactcc 
agggcacagt 
cagacggtct 
aacatgagga 
tgagatacca 
tgctgactga 



ggctgggtgt 
tttacataga 
actactacac 
gcacctggtt 
cgaccaacac 
tttgagacag 
actttgacaa 
ctgattcagg 
ctgaggaagt 
atgatccatg 
cagtgcatag 
atgtttaatc 
gtaatgttta 
tatgatggcc 
tgtgggattt 
atcagggtgc 
atcgctgagg 
tttattcgcg 
cccatgtagg 
cctgtggctc 
aatatcactg 
atgtctgtta 
tctgtgtgtt 
tacggtgatc 
gaaattttct 
tctttattcc 
gtattttcca 
ttctgtggat 
accatgcaag 
ataggctaga 
gttcccattt 
gttttggtca 
atctccagat 
aaccaaagta 
ggttgtccac 
gtcaaagctt 
gccagagcca 
aaactccaga 
ggattttttt 
gcctcfcgtta 
cagtttaaga 
gttgttattt 
accaatagct 
agtaactact 
tcctattttt 
ggcagtagtt 

gggggctgat 

aatgcttagt 
gctagtaact 
tggtgcggag 
ggttacattt 
aagcgcaagc 
agggggactt 
tggaggcccg 
ggcgcgtttc 
atttgacatc 
gagttgggag 
ttttaagtaa 



ggcttattct 
acccgaagcc 
agagcgagct 
ttgcttcagg 
gatctcggtt 
agacccgcgc 
tgcacaacgt 
aatgggttgt 
gtatgcacgt 
gttacgagtc 
ccggcgggca 
agaggtttat 
tgtccagcgt 
acgtgggttc 
tgaacaatat 
gctgctgtgc 
agaccactgc 
cgctgctgca 
cgtggacttc 
agcagctgga 
atgagcgttt 
cccatgatat 
gggagggagg 
aatataagct 
gcaattgaaa 
tgggcaatgt 
ctttcccagg 
tcactagtgc 
taaacttgac 
aggttaggca 
ctaaagttcc 
aatctaagga 
acagccaaag 
aatttgcagt 
agggtaagtt 
aaacccgctc 
attttagttt 
ccctttcctg 
atgctaactt 
tagtttatgt 
cgtagttttg 
atacgcatgc 
cctgcaactt 
gttagtgtat 
ccattattta 
agcgggggtg 
gtttgcaggg 
ttggagtctt 
atcagaggag 
gtgtccaaac 
tgggaggtga 
atgccgtggg 
tcttggaaac 
gtttctgtgt 
atctgcaaca 
ccatttaaac 
gaaggaaagg 
gtaatttatt 



gcggtggtgg 
agggggcgcc 
aagcgacgag 
aaatatgact 
gtctcggcgc 
taccatactg 
gagttacgtg 
tccctgggat 
gtgcctgtgt 
ctgggctctc 
ggttttggcc 
atggtaccgg 
gtttatgagg 
tgtggtcccc 
tgtggtgctg 
ccggaggaca 
catgttgtat 
gcaccaccgc 
cccttcgccg 
cagcgacatg 
ggctcgacag 
gatgcttttt 
tggcaggttg 
atgtggtggt 
aataaacacg 
aggagaaggt 
accatgtaaa 
cattaagtgt 
tgacaatgtt 
taaatccaac 
agtaatgttt 
atatactaac 
cagctacagt 
cattatctga 
tgtcatcatt 
caagtttagt 
ttattgggtt 
catttatagc 
ccagtttttt 
ccaagttatg 
tttgagcatc 
caccgcccgt 
tggttctaag 
cggagttttg 
cataaatagg 
atgcagttac 
ctagctttcc 
gcacggtcag 
cggtggttgc 
ttatgtttga 
ggtttccggc 
aggtgtccaa 
cattgggtga 
catatggata 
atatgaagat 
tttggagaaa 
a-ggtgafcgct 
cagtcgtagc 



atgttatcag 
tggatgcttt 
accggagacg 
acgtccggcg 
actccgtaca 
gaggatcatc 
cgaggtcttc 
atggttctga 
tgtgccaaca 
cactgtcatt 
agctggttta 
gaggtggtga 
ggtcgccact 
gccatgagct 
tgctgcagtt 
aggcgtctca 
tcctgcagga 
cctatcctga 
cccgttgagc 
aacttaagcg 
gaaaccgtgt 
aaggccagcc 
aatactaggg 
ggggctatac 
ttgaaacata 
gtaagagttg 
agacatagag 
aatggtaagt 
atttttagca 
tgcatttgtg 
tttaagtgag 
acttgcaacg 
agctagtact 
atgaattctg 
tttgtttcct 
aatcatggca 
gatatctgga 
tatggcagta 
agtattgttt 
agatgcatta 
aaatgggtaa 
tttaatttCG 
ggagttttgt 
tgctacttgc 
atcttccatg 
agtaagggtg 
atctgacact 
tggggcttgt 
cactgttagg 
ctttgttttt 
cttgtctagg 
aggttcggag 
aacaaatgga 
cacggggttg 
agtgggtgcg 
gtttgcagct 
gaataagctg 
cgtccgccga 



1740 
1800 

1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
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gtctttcacc gcgtcaaagt tgggaataaa 
aaaggggttg aagtaaaccg aaggcacgaa 
tccggagcgc ggctccgagg acgaggtctg 
tgaagagcgg ccagcgccgc cgatctgaaa 
gctcaccgac tcgtcgttga gctgaatacc 
ctgcccgggc gaccgcaccc tgtgacgaaa 
catctgaact tcggcctggg cgtctctggg 
ctggtacacG agggcagcgg gccaactacg 
aatagccgcc tgttcgagga gaattcggtt 
tatcatgtgg ggtcccgcgc tcatgtagtt 
agccgcaagt cccatttgtg gctggtaact 
cataatggcg ctgacgacag gtgctggcgc 
tcgcgcttaa atttgagaaa gggcgcgaaa 
gctgaagaga gccfcccgcgt cttccagcgt 
caggcagctg cgggtgaggg agcgcagaga 
gcccctgctt tgttgaaata tagcatacag 
ggtcgatacg ggttcgttgg gcgccagacg 
tgtggatttc ttgggctttg tcagagtctt 
cgctgctgtt gctgccgctg ccgctgccgc 
agtaatgcag gatgttacgg gggaaggcca 
cgaaggagat gttgccccca cagtcttgca 
gccacgagcg gtagccttgg cgctgttgtt 
acttaccggc cctggttcca gtggtgtccc 
ccggcggcgc ctgaggagcg gaggttgtag 
gcgccggcga ggggaatgcg accgagggtg 
cggaagcttc gtctaggctg tcccagtctt 
cctctgcctg actgtcccag tattcctcct 
gcttcttttt gggtgccatc ctgggaagca 
ggcgggggga ttgggttgag ctcctcgccg 
tttcgtagca gaaactcttg gcgggctttg 
ccctgggtaa tgacgcaggc ggtaagctcc 
aacctaatct cgtgggcgtg gtagtcctca 
cacagccccg gagtgagttt caaccccgga 
tgcagctcaa aggtaccgat aatttgactt 
gagcggtgcg gggtgcatag gttgcagcga 
acgtctfccca tgatgtcgga gtggtaggca 
tgaccccaaa gcggcggagg gcattcacgg 
gcacagcagg tggcgggcag aattcctgaa 
aacatgcttt gactggtgaa gtctggcaga 
gggaagataa tgtccgccag gtgcgcggcc 
tccttcaagt tttgctttag cagcttctgc 
tgctgccaca cgcccatggc cgtttgccag 
cggacgtagt cgcggcgcgc ctcgcccttg 
cggttttcgt gcaaaattcc aaggtaggag 
ttgcaggcct ggcgcacgta gccctggcga 
cgctgcatct ccgggtcagc aaagaaccgc 
actgcggcca tcattagctt gcgtcgctcc 
cagcgcgcca gctgctcatc gccaactgcg 
tttgcatccc tctccagggg tcgtgcacgg 
ataaccttgg ggggtaggtt aagtgccggg 
ttcagcacgg ctaggcgcgc gttgtcaccc 
tcattttcgc tgttttcttg ttgcagagcg 
ccctcaaaga tttttggcac ttcgtcgagc 
cgcaaggcca gctgcttgtc cgctcggctg 
cagttttgga aaaagatgtg ataggtggca 
aagttgaggc gcgggttggg ctcgcatgtg 
ggtgagaaca ggtggcgttc gtaggcaagg 
ctgcgctctt gcaacgcgtc gcagataatg 



ctggtccggg tagtggccgg gaggtccaga 5220 
ctcctcaata aattgtagag ttccaafcgcc 5280 
cagagttagg atcgcctgac ggggcgtaaa 534 0 
tgtcccgtcc ggacggagac caagagagga 5400 
tcgccctctg attttcaggt gagttatacc 5460 
gccgcccgca agctgcgccc ctgagttagt 552 0 
aagtaccaca gtggtgggag cgggactttc 5580 
gggattaagg ttattacgag gtgtggtggt 5640 
tcggtgggcg cggattccgt tgacccggga 5700 
tattcgggtt gagtagtctt gggcagctcc 5760 
ccacatgtag ggcgtgggaa tttccttgct 5820 
cgggtgtggc cgctggagat gacgtagttt 5880 
ctagtcctta agagtcagcg cgcagtattt 5940 
gcgccgaagc tgatcttcgc ttttgtgata 6000 
cctgtttttt attttcagct cttgttcttg 6060 
agtgggaaaa atcctatttc taagctcgcg 6120 
cagcgctcct cctcctgctg ctgccgccgc 6180 
gctatccggt cgcctttgct tctgtgtgac 6240 
cggtgcagta ggggctgtag agatgacggt 6300 
cgccgtgatg gtagagaaga aagcggcggg 6360 
agcaagcaac tatggcgttc ttgtgcccgc 6420 
gctcttgggc taacggcggc ggctgcttag 6480 
atctacggtt gggtcggcga acaggcagtg 6540 
cgatgctggg aacggttgcc aatttctggg 6600 
acggtgtttc gtctgacacc tcttcggcct 6660 
ccatcatctc ctcctcctcg tccaaaacct 6720 
cgtccgtggg tggcggcggc ggcagctgca 6780 
agggcccgcg gctgctgata gggctgcggc 6840 
gactgggggt ccaggtaaac cccccgtccc 6900 
ttgatggctt gcaattggcc aaggatgtgg 6960 
gcatttggcg ggcgggattg gtcttcgtag 7 020 
ggtacaaatt tgcgaaggta agccgacgtc 7080 
gccgcggact tttcgtcagg cgagggaccc 7140 
tcgctaagca gttgcgaatt gcagaccagg 72 0 0 
cagtgacact ccagtaggcc gtcaccgctc 72 60 
aggtagttgg ctagctgcag aaggtagcag 732 0 
tacttaatgg gcacaaagtc gctaggaagc 73 80 
cgctctagga taaagttcct aaagttttgc 7440 
ccctgttgca gggttttaag caggcgttcg 7500 
acggagcgct cgttgaaggc cgtccatagg 7560 
agctccttta ggttgcgctc ctccaggcat 7620 
gtgtagcaca gaaataagta aacgcagtcg 7680 
agcgtggaat gaagcacgtt ttgcccgagg 7740 
accaggttgc agagctccac gttggaaatt 780 0 
aaggtgtagt gcaacgtttc ctctagcttg 7860 
tgcatgcact caagctccac ggtaacaagc 7920 
tccaagtcgg caggctcgcg cgtctcaagc 7980 
ggtaggccct cctcggtttg ttcttgcaag 8040 
cgcacgatca gctcgctcat gactgtgctc 8100 
taggcaaagt gggtgacctc gatgctgcgt 8160 
tcaagttcca ccagcactcc acagtgactt 8220 
tttgccgcgc gtttctcgtc gcgtccaaga 82 80 
gaggcgatat caggtatgac agcgccctgc 83 40 
cggttggcac ggcaggatag gggtatcttg 8400 
agcacctctg gcacggcaaa tacggggtag 8460 
ccgttttctt; ggcgtttggg gggtacgcgc 8520 
ctgacatccg ctatggcgag gggcacatcg 8580 
gcgcactggc gctgcagatg cttcaacagc 8640 
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acgtcgtctc 
tcgtttgcct 

tcgtcgtctt 

gggggtgcct 

gcgaactcaa 
tgcctatagg 
cgcggacgcg 
tcgccgccgc 
gacgaagaag 
acctcgacgg 
cccgagcgcc 
gcgctacaaa 
aagcgcacgg 
gaggaaaagg 
ctgccgatcg 
taccacgtgg 
ctggcggccg 
accagcaaca 
tttgcagagg 
tgcgctgaga 
cacgtgattg 
agcaaggcca 
gacgcaaggt 
tgcggcatgt 
ttcatgcagg 
cggtgcgagt 
ttgactccgt 
agcgtgctgg 
tatcgcaact 
gacctgctaa 
ccgcggatgg 
ctgccagtgg 
gcaagggtgg 
gaaagtgtct 
tcctaatctg 
gctgttcctg 
gtaccccggt 
caaaggcgtg 
cgc tttgggt 
ggcgcaggag 
caaagggctc 
gtacgccccc 
aaacaaagcc 
ctttgacaat 
ttgcttgctt 
tctcgatgac 
cataaagaag 
cggagttggc 
tactggagaa 
tgtcaaccgc 
cgcgcatggt 
caacctgctg 
agtttctgaa 
agccaatatt 
gagctacgtt 
gcaggcggtc 
aggtgtgatt 
aagggtcgta 



ccacatctag 
ctgcgtcgtc 
cgcttacaaa 
cgacggggaa 
agggggcggt 
agaaggaaat 
gtgcggcgcg 
ctccccgcgc 
actcgtcaca 
cggatttggc 
cgccatcccc 
tggtgggttt 
tgcggcggct 
aagagtccag 
tgtctgcgtg 
ataacgatct 
tatgcaagac 
agacctttgt 
taacctacaa 
tcgaaggcga 
aaatggatgt 
agatcgtgaa 
gctgcgtgca 
tcttctctga 
cgctgtatcc 
gcaactcaaa 
tcgccctgag 
ccagcgtgca 
cgcgcgcgca 
acgcgttggt 
ttgtgcctga 
cgcatagcga 
ggggtaaata 
cctagtacat 
cgcactgtgg 
gttgcgacgc 
aataaggttc 
gagaaacatg 
ggacttttcc 
tgactcgtac 
aaagaggtag 
agtctcgcga 
tggaaagcgc 
ggctttcagt 
cttttatgtt 
gccgcggtgc 
ggtgggctcg 
gtagagaagg 
tgggatgcgc 
ggttttgcct 
gggagcgagg 
atactccttg 
gaacgagtac 
gtagttggcc 
gtagccctcc 
gttgcccggc 
aagatagaag 
gcctgatccc 



gtagtcgcca 
ctggtcttgc 
acctgggtcc 
ggtggtaggc 
taggctgtcc 
ggccagtcgg 
acgtccacca 
gcccccaaaa 
agatgcgctg 
cattgcgtcc 
agaggtgatc 
cagcaacGca 
gaatgaagac 
tgaagcggaa 
ggagaagggc 
aaaggcaaac 
ctggctaaac 
gacgatgatg 
gcaccacgag 
gcttaagtgt 
gacgagcgaa 
gaaccggtgg 
tgacgcggcc 
aggcgcaaag 
taacgcccag 
gcctgggcat 
caacgcggag 
ccacccggcg 
gggcggaggc 
gatggtgcgc 
gtttaagtgg 
tgcgcggcag 
atcacccgag 
tatttttaca 
ctgcggaagt 
agggtgggct 
atggtggggt 
cagcagaata 
agcgttatac 
tcaaactggt 
catgtttttg 
ccggccgtat 
ttgtcatagg 
tcctgctcac 
gtggcgttgc 
ggctggtgca 
tccatgggat 
ttttggccca 
caaagggtgc 
attagtgggt 
tagcctacga 
tatttagtat 
atgcggtcct 
aacatctgca 
ccgtcaactg 
cagctaacag 
gttccgtcaa 
agggaagggg 



tgcctttggt 
tttttatcct 
tgctcgataa 
gcgttggcgg 
tccttctcga 
gaagaggagc 
accatggagg 
aagcggctga 
gtgccgcgca 
aaaaagaaaa 
gtggacagcg 
ccggtgctaa 
gacccagtgg 
agtgaaagca 
atggaggctg 
ttcaagctac 
gaggagcacc 
ggsrcgattcc 
cccacgggct 
ctacacggga 
aacgggcagc 
ggccgaaatg 
tgtccggcca 
gctcaggtgg 
accgggcacg 
gcaccctttt 
gacctggacg 
ctgatagtgt 
cccaactgcg 
agcctgtgga 
agcactaaac 
aacccctttg 
agtgtacaaa 
tgtttttcaa 
agggcgagtg 
gtacctgggg 
tgtgatccat 
gtccacaggc 
agcggtcggg 
aaacctgctt 
agcgcgggtt 
tgactatggc 
tgcccaaaaa 
tggagcccat 
cggccgagaa 
cacggaccac 
ccacctcaaa 
ggtctgtgag 
gatcgcaaag 
agggcacgtt 
atcctgagtt 
cgtcaaccac 
tgtagctttc 
ccaggaacca 
agcgtttaat 
aagagtcaaa 
ggtatggtat 
tttcctttgt 



ccccccgccc 
ctgttggtac 
tcacttcctc 
catcggtgga 
ctgactccat 
agcgcgaaac 
acgtgtcgtc 
ggcggcgtct 
cacccagccc 
agaagcgccc 
aggaagaaag 
tcaagcacgg 
cgcggggtat 
cggtgataaa 
cgcgcgcgtt 
tgccfcgacca 
gcgggttgca 
tgcaggcgta 
gcgcgttgtg 
gcattatgat 
gcgcgctgaa 
tggtgcagat 
atcagttttc 
cttttaagca 
gtcaccttct 
tgggaaggca 
cggatctgat 
tccagtgctg 
acttcaagat 
gtgaaaactt 
accagtatcg 
atttttaaac 
taaaaacatt 
gtgacaaaaa 
gcgctccagg 
actgttaagc 
gggagtttgg 
ggccgagttg 
ggaagaagca 
gagtcgttgg 
ccaggcaaag 
gcaggcgagc 
afcatggccca 
ggcggcagct 
gggcgtgcgc 
gtcaaagact 
agtcatgtct 
tgcgcccatg 
aaactttttc 
ggcggggtaa 
gttatgctgg 
ttgccggctc 
tggaatgtag 
gtccttggtc 
ctcaaactca 
ggtaatggcc 
ggagccagag 
cttcaagcgt 



gacttgttcc 
tgagcgatcc 
ctcctcaagc 
ggcggtggtg 
gatctttttc 
cacccccgag 
cccgtcgccg 
cgagtccgag 
gcggccatcg 
ctctcccaag 
agaagatgtg 
caagggaggt 
gcggacgcaa 
cccgctgagc 
gatggacaag 
agtggaagct 
gctgaccttc 
cctgcagtcg 
gctgcaccgc 
aaataaggag 
ggagcagtct 
ctccaacacc 
cggcaagtct 
gatcaaggct 
gatgccacta 
gctaccaaag 
ctccgacaag 
caaccctgtg 
atcggcgccc 
caccgagctg 
caacgtgtcc 
ggcgcagacg 
tgcctttatt 
gaagtggcgc 
aagctgtaga 
atggagttgg 
ggccagttgg 
ggcccctgca 
atggcgctac 
tcagaaaagc 
gccatccagt 
ttgtgtggag 
caaccaagat 
gttgttgatg 
aggtacacgg 
tcaaacaaaa 
agcgcgtggg 
gacataaagt 
tgggtaatac 
gcctgtccct 
tgaagaattc 
atgggctgga 
aagccctggt 
atgttgcact 
ttgggagtaa 
accttcttaa 
taggtgtagt 
gtgaaggccc 



8700 
8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 
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aaccgcgaaa 
cgggtatggg 
tgtcatttct 
tggcgtaaag 
ccagagagct 
agcctggcag 
agggcacgta 
ggtggttaaa 
tgtagcagtc 
ggttgtcaga 
caatattgga 
agttgttacc 
tatctccatt 
ccccaatacc 
caatgattct 
tggttctatc 
gcaaatctac 
tgctgttata 
ttggcataga 
aagacagatg 
gtttagtagc 
caacctttgg 
gatttgtagg 
ctctccctcc 
ctggttgata 
ctatttgtag 
gtgttttctt 
catcttcctc 
cacaggagtt 
taaaagtagg 
gcacacggtt 
cgcggtccac 
gcgtaggtgc 
cgcgggcaaa 
gcatgtaaga 
cggctcagca 
ctatttagaa 
gtgtgctttg 
ggcgcaggga 
gtfctctgctg 
gcactccggt 
cctcgtacga 
gacgcttttc 
cgggagggcg 
ccaccacccc 
ctttcaactt 
tcttaatggt 
caccgctcat 
gtcgcggggc 
agcgagcgtg 
cagtgtctct 
gcgatcagtg 
cgccgccggt 
ctacggtgca 
atatctgcaa 
ggcagcacca 
acgggctggc 
gctcgtcttc 



tgctgcccag 
gtatagcatg 
gagcatggct 
acaaatgctg 
ctgcagaacc 
gaggaggagg 
gcggccgttt 
gggattaacg 
tacaagcccg 
tatttccaca 
gtaaaggaaa 
cactcctatt 
atcgcctgag 
cccaagagga 
aacatctgga 
acctatggaa 
cacggcattt 
a'tacattagg 
ttgttgaccc 
tgtgtctggg 
attgccttgc 
aagaggcacc 
cctggcataa 
tgccgcatta 
ggaaggatct 
cccgcttttt 
agtagcctga 
ttcttcatcc 
aggagcgccc 
ccccctgtcc 
gtcacccaca 
agggatgaac 
caccgtgggg 
ctgcaccagc 
ccactgcggc 
gctcctctgg 
gcatcgtcgg 
ccagttgcca 
cgcgcggcta 
ggtgtcagcg 
agccatgggc 
gggaggctca 
gccacgcccc 
gggatcaagc 
gctaatgcca 
gtccctcagc 
ggaaccgaaa 
attgctggtg 
cagagacgca 
agactccaga 
gcgcctgcaa 
gaataaggag 
ggtgcgcacg 
ttcttcctcg 
gaaccacaaa 
gggtcctgcc 
gacggcgacg 
tggggcggta 



ttgcgcgatg 
ttggcggcgg 
tccagcgtgg 
tcaaacttaa 
atgttaacat 
tttttaatgg 
cccaacaaca 
ttgtccatgt 
ggagccacca 
ttggtggggt 
tttctccata 
tcattacgtg 
ccattgccat 
aaacaataat 
tcatagctgt 
tcaagcaaga 
agctgcgatg 
ccaataaaat 
aacatagctt 
gtttccatat 
cggtcgttca 
cctttttcat 
gatccatagc 
gcatcagctt 
gcgtatacag 
gtaafctgttt 
tctcgagcgt 
tcggcaactg 
ttgggagcta 
agcacgccgc 
gccagggtga 
cgcagcgtca 
tttctaaact 
ccggggctca 
atcatcgaag 
cggcgacatg 
cgcttcaggg 
ctggctacgg 
gggcgggtta 
gggggaggca 
gcgatgggac 
tctatttgcg 
tctggagaca 
ttactgttaa 
gaggccaggc 
atctggcctg 
tttttaatgc 
ccgatatctt 
aagttgatgt 
ctttttattt 
ggccacggat 
gggcaggata 
acgcatgccg 
gaatcccggc 
gaccggcttt 
tccttcgcga 
gcggcggcgg 
ggtgtagcca 



ggatggagat 
aaaggtagtc 
aggccgtgtt 
tgctagcccc 
ccttcctgaa 
caaaaaactt 
tggagcgata 
agtccagaga 
ctcgcttgtt 
tgtattttag 
ggttggcatt 
ttgcaaaagt 
tagccttaat 
ttggcaattc 
ctacagcctg 
gttgatagga 
cctgaccagc 
tgtccctgaa 
tagaattttc 
ttacatcttc 
aagaggtagt 
ccggaaccag 
atggtttcat 
cgttccactg 
gtttagcttg 
ctccagacaa 
tttgctcttc 
cccggccgct 
gagcgttgta 
ggatgtcaaa 
accgcgcttt 
aacgctggga 
tgttattcag 
ggtactccga 
gggtagccat 
gacgcataca 
attgcacccc 
gccgcatcga 
caacaacggc 
ggtccagcgt 
gggtggtggg 
tcaccagagt 
ctgtctccac 
tcttattttg 
catctaccac 
tgctgctgtt 
cgctccacag 
gccagtttcc 
cttccattct 
tgatttttcc 
gcaattccgg 
ccgccgcgca 
cccgtcaggc 
accgggaaac 
taaacgatgc 
gccaccctgc 
gttccagtgg 
cgatagccgg 



gggcacgttg 
attaaaggac 
gtgggccatg 
gtcaactcta 
gttccattca 
ttggggcacc 
acggaggcGc 
ccagcgcgcc 
catgtagtcg 
cttgtctggc 
taggttaatt 
ttcatctttt 
agcttgatag 
atcctcagtt 
attccacata 
cagctctgtg 
aagaacaccc 
agcaatgtaa 
atcacctttt 
actgtacaaa 
atttgagaag 
aacggattga 
gggagttgtt 
agattcgcca 
tgtttctgca 
aggagcctgg 
ttcttcctct 
atcttcggtt 
ggcagtgccg 
gtacgtggaa 
gtacgagtac 
ccggtctgtg 
gctgaagtac 
ggcgtcctgg 
cttggaaagc 
tgacacatac 
cagacccacg 
tcgcggaccg 
ggacggccct 
tacaggtgtg 
caggccttgc 
ttcttccctg 
ggccggtgga 
cactgcctgg 
cttttgttgg 
ccaggccttg 
cgagcGccag 
catgaacggg 
acaaaatagt 
acatgcaact 
gcacggcgcc 
tgcgacggtg 
cgtggccggc 
ggaggcggca 
tggggtggta 
gcacggaaat 
tggttcggcg 
gggtaggcgc 



gtggcgttgg 
tggtcgttgg 
gggaagaagg 
agatcgtttc 
tatgtatatg 
tgaatgtgaa 
gcattgcggt 
ccaaggttaa 
taggtgttgg 
aggtacagcg 
tccatggcaa 
gtccatgtag 
gtgtcagtta 
ccatggtttt 
gaaaaatatc 
tttctgtctt 
atgttgccag 
ttgggtctgt 
ccaggtttgt 
accacttttg 
aattgcaagt 
ccaccaaaag 
tttttaagca 
atttgaggtt 
ttgtctgatc 
gcatagacat 
tcttcatctt 
fcgttcccact 
gagtagggct 
gccatatcaa 
gcggtatcct 
gttacgtcgt 
gtctcggtgg 
cccgagatgt 
gggcgcacgg 
gacacgttag 
atgctgttca 
ctggcggcac 
ggcagcacag 
tgctggccca 
tttagtgcct 
tcgggccgcg 
ggctcctcta 
ttggccaggt 
aaattttgct 
ctgccatagt 
ctgaaggcgc 
cgcgagccgt 
tacaggacca 
tgtttttaat 
aatcgccgcg 
cgacgcgcgc 
catgcccctc 
ggtgagggcc 
gcgcgctgtt 
cggggccagc 
tcgggtagtc 
gatggaagga 



12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 



-16- 



wo 02/20814 



PCT/USOl/27682 



tgtagggcat 
cgccgggggg 
gacgcggccg 
gtttgtgtct 
tgaacgtcca 
acctcgggct 
cgcttgggta 
cgcttttctt 
ctaggtgttg 
taggcaaact 
tcatcataca 
gtgccgggtc 
gcggttgcgt 
ttctttttga 
ccatagatct 
gcttcgttgc 
cttgcaatct 
tcgcgcaccc 
gctgcggcgg 
cttcgagcgg 
gccgccgcgc 
cgccgtctct 
tccactgtgg 
tcaatggcgt 
gcgcggtagt 
aacatcttgc 
gacatgtttg 
ttgtaaacgt 
ccgatgctgt 
acggtggtga 
aacacgtggg 
tagaagctct 
tggctcgtgg 
tgctccgagc 
tgagggtcgc 
ttgcttatta 
ttcttctgag 
tcggcctcag 
tcgttcatgt 
ccgctgccac 
aagcttgcct 
tcgtaggtga 
ctcaggcggc 
aaggcttcgt 
ctgaagttgc 
cccactttca 
gagtagttgc 
tccttagtaa 
tcgttcacat 
cggtcgtcaa 
atgaccgtgg 
gacttgttgt 
cggatgctgt 
aggggagcat 
tcgtaggagg 
caagaaaacc 
tttttttttt 
actccctccc 



afctcgggcag 
ctgaaacgcg 
cagcggccgc 
ccatgccctc 
cggtctgcac 
ccagcccagg 
ccatcagctg 
cggacggtgc 
ggttgccctc 
ccccgaggcg 
cgcgcttgta 
gcaaaacacg 
gcagcagttc 
cccgctttag 
ccggcgcgat 
gcgccgccgc 
agttgcgcgg 
agtacacgtt 
ccgctcgtcg 
cccgcatggc 
gttgggcggc 
tcattttagc 
acactggtgg 
catcgacggt 
gcccgcgcac 
ttgggaagcg 
ctcaaaaagt 
aggggcaggt 
tgcgcagcgg 
tggtgggggc 
tcagagaggt 
tggagtgcac 
agcggaaggt 
cgcaggttac 
cgtagttgta 
ggttgtaact 
gcttctcgac 
cgcgcttctc 
cctccaccgg 
tgttgttgcc 
ggtaggcgtc 
tcctaaagcc 
tgtgggtgaa 
ttgtatatac 
gggtgtcaaa 
agtagtgctc 
cctcgggcag 
gcgagcgcga 
ttggcatgtt 
gattgatggt 
ttagaaagtt 
ccacaaggta 
ttctcccccc 
cgaaggggga 
gaggaggacc 
aacgctcggt 
ttttaaaaca 
aaatccagga 



tagtgcgctg 
aaacatccac 
ctgcaccgcg 
tgtggcagtg 
gcccagtccc 
ctccacggtc 
cacggtgggt 
aagcgtgggc 
gtccagcggc 
ctcgttggcc 
ggtgcgggtg 
tcttacgcgt 
cacctcgtcg 
ctttcggggc 
gacctggagc 
cgctggatac 
ggggcgggtg 
gcccctgcga 
cctggacctg 
cgcccgtcgg 
agtgccgggt 
ataacgccgg 
cggcgtgggc 
ggtgcgccca 
gcgcactggg 
caggccccag 
gcggctcgat 
gcggcgtctg 
tagcgtcccg 
tggcgggcgc 
aaactggcgg 
gggcaacagc 
cacggggtct 
gtcaggagtg 
tgcaaggtac 
gcgtttcttg 
ctcgggttgc 
ctccgcccgt 
ctgcattgcc 
gccgcctgcg 
cacatccaac 
ctcctggaag 
gtccaccccg 
cGcaggcatg 
ctttaccccg 
cacgatcgcg 
cgtgaactcc 
caccatcacc 
ggtatgcagg 
ctgtgtgctt 
gctgtggtcg 
cacacgggtg 
ggtaggccgc 
acccagcgcc 
ttcctcatac 
gccatggcct 
ttctccccag 
cgctgctgtc 



gcggtgccgt 
gggtccgttt 
gcatctgcca 
gcaatactag 
ggtgccacct 
attttttcca 
gccaagtcac 
agcacctgct 
aacgccaaca 
tgctcaagca 
gagcgctcac 
cgacctttcc 
tcaagttcat 
ttgtaatcct 
atctcttctt 
atacaacagt 
cgcacgggca 
ccctgagtca 

gggggcacag 

ccggtgcgac 
cggcggcggt 
gctccgcgca 
gtgtagttgc 
gtgcggccgc 
tgttggtcgg 
cctgtgttat 
aggacgcgcg 
gcgtcagtaa 
tgatctgtga 
gccaaaatct 
atgagctggg 
tcggcgccca 
tgcatcatgt 
caaaggaggg 
cagctgcggt 
ctgtcctctg 
gcagcggggg 
gtggcaaagg 
gcggctgccg 
ccatccccgc 
agtgcgggaa 
ggttgccgct 
catcctggca 
acaagaccag 
atgtcgcttt 
ttgttcataa 
acccactcat 
cgcgccttaa 
atggttttca 
gcctcccccg 
ttctggtagt 
gtgtcgaata 
aggtaccgcg 
gccgccactg 
atcgccgcgc 
tggtgagttt 
cctggggcga 
gtctgccgag 



acttcctgga 
gcacctccgt 
ccgccgaggc 
tgctactggt 
gcttgattgg 
agacatcttc 
cagactcgcg 
gcagtgtcac 
tgtccttatg 
ggtcctcgtc 
cgggcgtaaa 
actgtacccg 
catcatcatc 
gctcttcctt 
tgattttgcg 
acgagtctaa 
cgcgcaggcc 
tagcactaat 
tgacaatacc 
gtgcgcggtt 
ggcgacgtgc 
ccacggtctg 
gcgcctcctc 
gfcttgtgcgc 
agcgcttctt 
tgctgggcga 
gcgagactat 
tggtcactcg 
gagcaggaac 
ggttctcggg 
agtagacggc 
ccaccggaaa 
ctggcaacga 
tccatgagcg 
actgggtgaa 
tcaggggttt 
cggcagcttc 
tgtcgccgcg 
cgttggagtt 
cctgttcggt 
tgttaccacc 
tgcggatgcc 
gcaaaatgat 
tgactgggtc 
ccagaaccGC 
ggtctatggt 
atttcagctc 
acttattggt 
ggtcgccgcc 
ggctgtagtc 
tcagggatgc 

ggggtgccaa 

gaggcacaaa 
gcgccgcgct 
gctgcatact 
tttattttgc 

aggtgcgcaa 
tcatcgtcct 



acggcgcggg 
agaggttttg 
aaccggggac 

ggtgggtatc 
ccgcacgcgg 
cagtcgctgg 
ctttaggccg 
gggctttagg 
ccgctttccg 
gccgtacacc 
aactacggtg 
ccgcctgggc 
atctttcttt 
cttcgggggg 
cttggacata 
gtagtttttt 
gctaaccgag 
ggccgcggct 
cgcggccagc 
aagcagggcc 
tacgcgcctc 
aatggccgcg 
caccaccgcg 
gccccagggc 
tgccccgcca 
tataaggatg 
gcccagggcc 
ctggactcct 
gttttcactg 
aaagcgattg 
ctggtcgttg 
gttgctgatc 
ccagtagacc 
gatcccggtc 
ggtgctgtca 
gatcaccggt 
tgccgctgcc 
aatggcatga 
ctcttccgcg 
gtcatctttt 
ctccaggtca 
caacaagttg 
gtctggatgg 
aaaccGcagt 
gttctgcctg 
catggtctcg 
cacctgtttg 
aaacatgaac 
ccagtgcgaa 
attgttttga 
cacatccgtt 
ctcagagtaa 
cggcgggtcc 
caccacgctc 
aaggggaata 
atcatgcttt 
acgggttgcc 
cccacaccag 



15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
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accccgctga 
gtgctcctgc 
gttgggaaat 
gtactcctcc 
caagcttgga 
gctgcttggc 
caactctagc 
gaatccatcg 
ccagtgcggg 
caagtagtcc 
gttgcgcgcg 
gctcacgctc 
cgaggggatc 
cgtgtcctcc 
cacatgcgcc 
ccgggacacg 
gggcgtcagc 
cccctgcaag 
ctggaaaaaa 
tagccgcagt 
ctgtgtactc 
gcgctcacgc 
cacgttgccg 
ctcatcgggc 
gtgcgtgggg 
ctgcggctgc 
gacctggcgc 
caccgccggg 
ccgtcctctg 
gcgcgcgcgg 
cttggggccc 
ctgcccgtgc 
cgccgccggc 
tcgctcaagg 
ttgtctatgg 
agcatggaat 
actatgctct 
tttagcagcg 
tgcgccatga 
tgcatcagtc 
cttttttgaa 
gccgccacgt 
tcgcgcagca 
cgatcccgca 
acgtacgcct 
gctcctagcc 
cgcgggggtt 
gcggtaggag 
tgaggagggg 
tcgtccctgt 
cgttcgccgc 
gaaaafcaacc 
cgcttacgcc 
aggaagccaa 
ggcggtaacc 
gccatgatac 
tccttttgca 
accatggagc 



cggtcgtgcc 
gcatacgtct 
gccggaggca 
tcgcccagca 
aatgggctac 
ctgcggaagc 
agggtctgcg 
ttaccctcgg 
ttcaagatgg 
attaggcgat 
gtcatgtcca 
tgctgcacat 
ttctgccgcc 
fcgccccagcg 
tgacctatgg 
ctgccactgt 
aagctagaca 
ttcttgaaag 
tagtctggcc 
gcgctcacaa 
aggaaggcgt 
tgcgccacgg 
ctgttgtaac 
cggatggccc 
tttgcgcgcg 
tgccggaacg 
cagtcgtccg 
tccgttgcgt 
gctcgtactc 
gtgccaccgc 
agcgcaggtc 
cagccagggc 
tcacgctcac 
taagcacctt 
gaacgtaagg 
agttaatggc 
gcagaatgtt 
catccctgaa 
gcggcttgct 
ctatagccac 
agttaatctc 
gtgcgcgcgc 
accgctcgcg 
tctcctcggg 
cgcgcgtgtc 
gcgccaggcc 
cgtaatcacc 
aaggggaggg 
ggcgcatctg 
ttccggagga 
agtccggccg 
ctccggctac 
gcgcgcggcc 
aaggagcgct 
gcatggatca 
ccttgcgaat 
cggtctagag 
actttttgcc 



tttgacgacg 
tccatctact 
ggttcttttc 
ggcgcgggcg 
tcgcatctga 
tttcctttcg 
gttgcgggga 
gcacctcaaa 
cattggtgaa 
tgataaacgg 
gcgccacgct 
agcgcaagat 
ggttggtcag 
cgcggctgac 
cctcgcggta 
ccgtgaaggg 
cggtcgcgcg 
cctggctcag 
cggactggta 
agttggtgta 
ttagtgcaac 
cctcgcgcac 
gagccacgcg 
tgttttcggc 
ccgggaccac 
cgtcagggtt 
tggagttaag 
cttgcatcat 
atcgtcctcg 
cagcccaggt 
agcgcccgcg 
cctttgcagg 
ggccttgtgg 
caacgccatg 
ggtatggtat 
ggccaccttg 
tatcaaatcg 
tgcctcgttg 
atttgggttt 
ctcctcgcgc 
ctggttcacc 
gggactaatc 
gttcaggcca 
ctcctctccc 
acgcttcagc 
ctcgccctcc 
atctgccgcc 
tgccctgcat 
ccgcagcacc 
atttgcaagc 
gcccgagact 
agggagcgag 
agtggccaaa 
cccccgttgt 
cggcggacgg 
ttatccacca 
cgtcaacgac 
gctgcgcaac 



ggtgggcggg 
catcttgtcc 
gcgctgcggc 
ggtggtgcga 
ccgcggggcc 
cagcgccgcc 
aaacacgctg 
tcccccggtg 
atactcgggg 
ccggtttgag 
gg'gcgttacc 
gcgctcctcc 
caggtagttc 
acttgtaatc 
cagtgtcagc 
cgctattagc 
gtcgcctgtg 
gtttacggtc 
cacctcactt 
gtcctcctgt 
catggagccc 
atcccccacc 
ctgaagcagc 
cagcgcgttt 
cgcttccaga 
acgcgcagtc 
gccggacggc 
ctgatcagaa 
tcatattcct 
ccggccccag 
tcaaagtagg 
ctgtgcatca 
atgcgctcgt 
cgcatgtaga 
atcttgcggg 
fccagccaggc 
agcagccagc 
tccctgctgt 
tgctccagcg 
gccacaagcg 
gtctgctcgt 
ccggtccgcg 
tgccgcaact 
tcgcggtcgc 
tgcacccttg 
tccaagtcca 
gcgtcagccg 
gtctgccgct 
ggatgcatct 

ggggtcttgc 

cgaaccgggg 
ccacttaatg 
aaagctagcg 
ctgacgtcgc 
ccggatccgg 
gaccacggaa 
tgcgcacgcc 
atctggaacc 



cgcgggccgg 
actaggctct 
tgcagcagcg 
gtgctggtaa 
gcagcgccta 
tctgcctgct 
tcgtctatgt 
tagaaaccag 
ttcacggcgg 
gcatacatgc 
ccgtcgcgca 
tcgctgttta 
agggttgcct 
tcctggaaag 
aagtgaccta 
agcagcaaca 
ggagcccgca 
tgcaggcctt 
tgcggtgtct 
ccccgcggca 
aggttgccct 
agccggtcca 
gcgtcgtaga 
acgatcgcca 
attgcggaga 
agcgacatga 
tggctctgca 
acatcaccgc 
ccacgccgcc 
ctgcctccag 
actcggcctc 
gctcgcggtc 
tgcgataaac 
acccctcgat 
cgtaaaactt 
tcaagctgcg 
ggccctcggg 
gctgcactat 
cgcttacaaa 
tgcgcacgtg 
acgcggttac 
cgtcgggctc 
cgcgccctgc 
gaaacaggtt 
ggtgtcgctc 
ggtagtgccg 
cggatgttgc 
gctcttgctc 
gggaaaagca 
atgacgggga 
gtcctgcgac 
ctttcgcttt 
cagcagccgc 
acacctgggt 
ggttcgaacc 
gagtgcccgc 
tcaccggcca 
gcgtccgcga 



gcacatccct 
ctatcccgtt 
agttgtttag 
aagaccctat 
gatcggacaa 
cgcgctgttg 
cgtcccagag 
ggggcggtag 
ccgcgcgatg 
ccggttccat 
tcaggttaag 
aactgtgcaa 
ccaggctgcc 
tatgctcgtc 
ggtatgtgtc 
ggcgcgagtt 
ccccccacag 
gtctactggt 
cagtcaccat 
cgfctggcggg 
gctgctgcgc 
ggttggtctg 
ccaggccggc 
gcaccttctc 
gccggttggc 
tgcggtccat 
gcgccgcccg 
ttagtactcg 
gacgttgcca 
ggcgcgtcgg 
tctatcgccg 
gctgagctcg 
gcccaggtcg 
ctttacctcc 
gcccagactg 
ctcctgca.cc 
ctctactatg 
aaggaacagc 
gtcccacaga 
gttgttaaag 
caggtcggcg 
aaagtcctcc 
gtggaacttt 
ctgccgcggc 
aggagagggc 
ggcccggcgc 
ccctcctgac 
ttgccgctgc 
aaaaaggggc 
ggcaaacccc 
tcaacccttg 
ccagcctaac 
cgcgcctgga 
tcgacacgcg 
ccggtcgtcc 
ttacaggctc 
gagcgtcccg 
ctttccgcgc 



19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
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gcctccacca 
cttatgttgg 
cagccgccgc 
gtctttgact 
cagaccgtta 
caccgctttg 
ttagccgagc 
acacgcatgg 
gatgcaagag 
caagactact 
ctgcgcattc 
aagaccgcct 
cacccgctgc 
tttctcgaga 
gtacctacac 
cccccgccaa 
aacggccgcg 
gtcgaccgcc 
gaagaagaag 
gtagcctttg 
gagttaaccg 
gccatggagc 
atgtacttct 
cgaaactacg 
gcccgcgatg 
gccttctcgc 
ggacgcggag 
gacaactcag 
gattctgtcg 
cgccagattc 
caccagctcc 
gagccccccc 
acacccccgc 
tcttgcaagt 
tagcccgcg;c 
cgccgcggga 
gaccgccgcg 
taactcccct 
caa.a.cccgcc 
tcaacctcgt 
aaagccagca 
actcctccaa 
agcgtctctt 
agctcgtgcc 
cgcgagacct 
actgcatcac 
tgcaaatgct 
cagactgggc 
ttaaaaaatt 
acaacatcaa 
aggtgccggg 
tcaacgatgt 
tctgggagca 
ttagggacac 
tacccgtaga 
cttaccgttc 
ttgtcctcaa 
aaaccctgga 



ccgccgccgg 
aagacctcgc 
cgcacttttt 
caagggctta 
actggtccgt 
tggacatgga 
gcgttgtcgc 
gaggaagagg 
atgcaggaca 
acaaagacct 
agcaggccgg 
actttaatta 
cgcccgccac 
ggttttccga 
aacaattgtt 
cccataaccg 
ccgtcaccga 
tcccggtgcg 
aagaagggga 
agcgcgaggt 
tgtcggcgcg 
gccttgaggc 
tcgtggcaga 
ccgtcttcgc 
ccgaaggggg 
agctcatggc 
atctccagga 
gagacgtgca 
aactctcttt 
aggagatcaa 
tgcccgcgcg 
tacctccggg 
ggcccaccgc 
catcgacgcc 
cctgacccgg 
gctccagacc 
cgcggacatc 
cgagcagcca 
a.ggcgagggg 
gcaggatgta 
cgagtgttcg 
ctggtggcgg 
tgtcacctac 
cttcatgctg 
agccgtggac 
cccagaaaaa 
aatggcccgt 
Gctgtcagaa 
gccctccatc 
cggcttcgac 
acccttccgc 
caccttcgcc 
erggcggatgc 
ctttgcgctc 
aaagggatgc 
ggaggccgac 
ccgcgagctg 
ctactgcgcc 



catcacctgg 
ccGcggagcc 
ggtgggatat 
ctcgcgtctc 
tatggccaac 
tgacttccag 
cgacctggcc 
gcgccaccta 
agaggaagga 
gcgccgatgt 
acccaaggac 
catcatcagc 
ggtgctcagc 
tccggtcgat 
gagatgcatc 
ggacatgacg 
gaccatgcgc 
ccgtcgtcgc 
ggcccttatg 
gcgcgacact 
caactcccag 
cttgggggat 
acacaccgcc 
ccggcacgtg 
cgtggtctac 
ccgcatctcc 
ggaagagatc 
ggagattttg 
caggttcaag 
ccgccgcgtc 
cggcgccgac 
ggcccgcccg 
ccgccgcgcg 
gccaccaacc 
ctatgcgagg 
atggacagct 
tggactttgg 
gacggtcaag 
ctcaaattcc 
cagcccgtgc 
gcccgtcgca 
gagatccagt 
gatgtagaga 
gttatgaagt 
cttggatggg 
atggccatag 
gacctgtgga 
cacgggctca 
a-agggcaccc 
gagatcgtgc 
atcacacgca 
ctgccaaacc 
gacgacactg 
acccacacct 
tgcgcctacG 
gggtttccga 
tggaaaaaaa 
ctagacgtgc 



atgtccaggt 
ccggccaccc 
cagtacctgg 
aggtacaccg 
tgcacttaca 
tctaccctca 
ctgcttcagc 
cggccaaact 
gaagaagaag 
caaaacgaag 
atggtgcttc 
agcacctccg 
ctaccttgcg 
gcggactcgc 
gttagcgccg 
ggcggcgtct 
cgtcgccgcg 
cgccgtgtcc 
gaagaggaga 
gtcgccgagc 
tttttcaact 
atcaacgaat 
accaccctca 
gagctcaatc 
agccgcgtct 
aacgacctcg 
gagcagttca 
cgccaggccg 
ctcaccgggc 
gtcgcgttcg 
gtgcccctgc 
cgtcaccgct 
gtaccgtagt 
agcccctgga 
taaacctgca 
cccatctgcg 
gctcgcgcgg 
gacaagcagc 
cactctgctt 
accgctgcca 
gggacttcta 
tcttcccgat 
cctatacttg 
tcggcggaga 
accgctggga 
gtcgccagtt 
gctcattcgt 
gctcccctga 
cgcgcttctt 
tcgccgccca 
actttatgcc 
cgcgttccaa 
acttcaaata 
cgctccggaa 
aggccgtcaa 
tccaagagta 
agggacagga 
aggtcaccgc 



acatctacgg 
tacgctggcc 
tgcggacttg 
agctctcgca 
ccatcaacac 
cgcaggtgca 
cgatgagggg 
ccgccgccgc 
tgccggtaga 
cctggggcat 
tgtcgaccat 
ccagaaacaa 
actgtgactg 
tcaggtccct 
tatccctgcc 
tccaactgcg 
d'ggagatgat 
cccctccccc 
ttgaagaaga 
tcatccgtct 
tcgccgtgga 
ccacgttgcg 
actacctctt 
tcgcgcaggt 
ggaacgaggg 
ccgccaccgt 
tggccgaaat 
ccgtcaacga 
ccgtcgtctt 
ccagcaacct 
cccctctccc 
tttagatgca 
cgcgccgcgg 
aatcaggtat 
ggagctcccg 
cgatgttgtc 
cgtggtggtc 
cgaagtagaa 
ccttgtgcgc 
gtactgcgca 
ctttcaccac 
cggctcgcat 
gatgggggcG 
tgagcctctg 
acaagacccg 
taggaccttt 
cgcttccaac 
ggagctcacc 
ggaaGtttac 
ggtaattaaG 
tcgGgcggga 
aaagcgcacg 
ccagtacctc 
ggccgcgcag 
ccagttctac 
ctggaaagac 
taagtatgac 
GgagGtggtc 



atatcatcgc 
cctctaccgc 
caacgactac 
gccgggtcac 
gggcgcatac 
gcaggccata 
cttcggggtc 
cgtagGgata 
aaggctcatg 
ggccgaGcgc 
ccgccgtctG 
ccccgacGgc 
gttagacgcG 
cggtggcgga 
gcacggcagc 
cccccgcgag 
cgagcgcttt 
accgccgcca 
agaggcccct 
tctggaggag 
cttctacgag 
acgctgggtt 
tcagcgcctg 
ggtGatgcgc 
aggcctcaac 
ggagcgagcc 
cgcctatcaa 
caccgaaatt 
cacgcagagg 
ccgcgcgcag 
ggcgggtccc 
tcatccaagg 
ggatgcggGc 
cacctggacc 
cctgacctgt 
atcaagctcc 
cgatccacca 
gaccaccagc 
ggfccgtcagg 
cgtttttaca 
a.tca.a.ca.gcc 
CGtGgcaGcg 
tttgggaagc 
gtgaccgccg 
cttaccttct 
cgcgaccacc 
cctGatcttg 
tacgaggaac 
atcgtgggcG 
aaGcgttGcg 
aagatacttt 
gactttttgc 
aaagtcatgg 
gcatacgcgc 
atgctaggct 
cgcgaagagt 
atcatcaagg 
aacaagctgc 



22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
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gcgactccta cgcctccttc gtgcgtgacg 
tcttccagcg tccaaccata tcatccaact 
gagcagagca gcccgcccgt agcaacctcg 
tatacgatta cgtgcgcgcc agcatccgcg 
tactcagaga gcccctctac gtttacgaca 
accccatgcc atggggtccc ccactcaacc 
ggcagcaggc gctagacttg caaggatgca 
ccggggtctt taccgtggac gcagaccccc 
cattctgttc gcgcaagggc ggccgcctct 
tagccaccag cgttgacctt gtcaccctgc 
ccgacgagcg caccaccgtc tttcccgaat 
taaacatcgc ggccaaggag cgcgccgatc 
ccaagttgct gtccaacgcc ctctacgggt 
ttgtcttttc tgaccagatg gacgcggcca 
atatcaaatc ctcctcgttt ttggaaactg 
ttgagaggga gtactcaccc caacagctgg 
aggacgaacg cgcccccacc cccttttata 
cctacaccta taaaccaatc accttccttg 
ccctggagcg agtggacccc ctagtggaca 
tcgtgctggc ctggacgcga gccttcgtct 
accgcggaac accgctcgag gacaggcctc 
ttttcgtcac cgagcgtgga caccggctca 
agcatggggg aaacctggtt tttgaccccg 
gcgagaccgt ctgcggggcc tgcggcgcgg 
cgcccaagct ctacgccctt aaaagtctgc 
gcaagctgcg cgccaagggc cacgccgcgg 
gctacctggc cgacgcgcag ggcgaagacc 
tcaagcgcac cctggccagc gcgcagcccg 
cgctgacgag gaccctgcgc ccgtggaaag 
gactactgcc gtacagcgaa agccgcccca 
agatgccgta gagcacgtga ccgagctgtg 
caaaagcatg cctacggcgg acggcctcaa 
actgctatcg ctgggcggcg agcgccttct 
cagggacatg cttaacgaag tggcccccct 
taactaccag ttgcagccgg taataggtgt 
gcagctgctc aggaacctgc tttcttccca 
cttcatcgcc ccgcaggtag acatgatccc 
aatctgtgag ggtaactacg cccctgggcc 
cctccgcccg cgcfcttgtaa aaatggccta 
cgttagtgat cccagaaata tcttcgccca 
tatggacgaa tgcatggaaa atctcggagg 
atttccttct aagctacatg acaaatttcc 
tctgcacaac atgaatcccc ggagggatat 
gtccaagatg catctcatat ccccacgtat 
cacttacacc aagggcctgc ccctggcaat 
ccacgcccag cgctcctgct acgactggat 
tctgcagtgg tgctacctcc accccagaga 
gagtcacctt taccacgtcc tggaaaaaat 
gtcccgggcc taccgcgcgc gcaaaacccc 
caaaatccaa acagagtctg gtttttattt 
gccttcaggg cagaaacctg ctggcgcaga 
tcccgggtca aagaatccaa ttgtgccaaa 
gaacgggaag ctgcactgct tgcaagcggg 
cgggcggtgg ctgcagcggc tgaagcggcg 
gacacggtct cgtaggtcaa ggtagtagag 
ctggagccca tcacattctg acgcaccccg 
gagctcacaa tgcttccatc aaacgagttg 
atacaaaact acatgagacc cccaccttat 



cggtaggtct cacagacgcc agcttcaacg 26100 
cacatgccat cttcaggcag atagtcttcc 26160 
gtcGcgacct cctcgctccc tcgcacgaac 26220 
gtggaagatg ctaccctaca tatcttggaa 26280 
tttgcggcat gtacgcctcc gcgctcaccc 2 6340 
catacgagcg cgcgcttgcc gcccgcgcat 26400 
agatagacta cttcgacgcg cgcctgctgc 26460 
cggacgagac gcagctagac ccactaccgc 2 652 0 
gctggaccaa cgagcgccta cgcggagagg 2 6580 
acaaccgcgg ttggcgcgtg cacctggtgc 26640 
ggcggtgcgt tgcgcgcgaa tacgtgcagc 26700 
gcgacaaaaa ccaaaccctg cgctccatcg 26760 
cgtttgccac caagcttgac aacaaaaaga 2 6820 
ccctcaaagg catcaccgcg ggccaggtga 26880 
acaatcttag cgcagaagtc atgcccgctt 26940 
ccctcgcaga cagcgatgcg gaagagagtg 27000 
gccccccttc aggaacaccc ggtcacgtgg 27060 
atgccgaaga gggcgacatg tgtcttcaca 27120 
acgaccgcta cccctcccac ttagcctcct 27180 
cagagtggtc cgagtttcta tacgaggagg 27240 
tcaagtctgt atacggggac acggacagcc 27300 
tggaaaccag aggtaagaaa cgcatcaaaa 273 6 0 
aacggccaga gctcacctgg ctcgtggaat 27420 
atgcctactc cccggaatcg gtatttctcg 27480 
actgcccctc gtgcggcgcc tcctccaagg 27540 
cLggggctgga. ctatgacacc atggtcaaat 27600 
ggcagcgctt cagcaccagc aggaccagcc 27660 
gagcgcaccc cttcaccgtg acccagacta 27720 
acatgaccct ggcccgtctg gacgagcacc 27780 
acccgcgaaa cgaggagata tgctggatcg 2 7 840 
ggaccgcctg gaactgcttg gtcaaacgct 27900 
accgttgaaa aactttgctt ccttgcaaga 27960 
ggcgcatttg gtcagggaaa acatgcaagt 2 8020 
gctcagggat gacggcagct gcagctctct 2 8080 
gatttacggg cccaccggct gcggtaagtc 28140 
gctgatctcc cctaccccgg aaacggtttt 2 8200 
cccatctgaa ctcaaagcgt gggaaatgca 282 60 
ggatggaacc attataccgc agtctggcac 2 8320 
tgacgatctc atcctggaac acaactatga 28380 
ggccgccgcc cgtgggccca ttgccatcat 28440 
tcacaagggc gtctccaagt tcttccacgc 28500 
caagtgcacc ggatacactg tgctggtggt 2 8560 
ggctgggaac atagccaacc taaaaataca 28620 
gcacccatcc cagcttaacc gctttgtaaa 2 8680 
cagcttgcta ctgaaagaca tttttaggca 2 8740 
catctacaac accaccccgc agcatgaagc 28800 
cgggcttatg cccatgtatc tgaacatcca 28860 
acacaggacc ctcaacgacc gagaccgctg 28920 
taaataaaga cagcaagaca cttgcttgat 28980 
atgttttaaa ccgcattggg aggggaggaa 29040 
tccaacagct gctgagaaac gacattaagt 2 9100 
agagccgtca acttgtcatc gcgggcggat 29160 
ctcaggaaag caaagtcagt cacaatcccg 29220 
gcggaggctg cagtctccaa cggcgttcca 29280 
tttgcgggca ggacggggcg accatcaatg 29340 
gcccatgggg gcatgcgcgt tgtcaaatat 29400 
gcgctcatgg cggcggctgc tgcaaaacag 29460 
atattctttc ccacccttaa gccccgccca 29520 
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tcgatggcaa 
tcagtgtcgc 
taatacgata 
tgtgatatgt 
tacggggcgg 
tccgttcttc 
aacgacaggt 
ccgtggaatg 
ccaggaaccg 
agcatcacaa 
accaggcgtt 
ccggatacct 
gtaggtatct 
ccgttcagcc 
gacacgactt 
taggcggtgc 
tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 
ttcgttcatc 
taccatccgg 
tatcagcaat 
ccgcctccat 
atagttttcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
catagcagaa 
aggatcttac 
tctgcatctt 
gcaaaaaagg 
tattattgaa 



acagctatta 
tgatttgtat 
cctgcgtcat 
agatgataat 
cgacctcgcg 
ttcgtcataa 
gctgaaagcg 
aacaatggaa 
taaaaaggcc 
aaatcaacgc 
tccccctgga 
gtccgccttt 
cagttcggtg 
cgaccgctgc 
atcgccactg 
tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
Gcccagtgct 
aaaccagcca 
Gcagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 



tgggtattat 
tgtctgaagt 
aattgattat 
cattatcact 
ggttttcgct 
cttaafcgttt 
aggctttttg 
gttaacggat 
gcgttgctgg 
tcaagtcaga 
agctccctcg 
ctcccttcgg 
taggfccgttc 
gccttatccg 
gcagcagcca 
ttgaagtggt 
ccaaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaagtg 
agttgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtatt 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 



gggtgctagc 
tgtttttacg 
ttgacgtggt 
ttacgggtcc 
atttatgaaa 
ttatttaaaa 
gcctctgtcg 
ccaggccgcg 
cgtttttcca 
ggtggcgaaa 
tgcgctctcc 
gaagcgtggc 
gctccaagct 
gtaactatcg 
ctggtaacag 
ggcctaacta 
ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatcag 
ttaaatcaat 
gtgaggcacc 
tagtgtagat 
cgcgtgaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatagt 
tgcataattc 
caaccaagaa 
gggaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatcagcg 



gacatgaggt 
ttaagttgat 
ttgatggcct 
tttccggtga 
attttccggt 
taccctctga 
tttcctttct 
agcaaaaggc 
taggctccgc 
cccgacagga 
tgttccgacc 
gctttctcat 
gggctgtgtg 
tcttgagtcc 
gattagcaga 
cggctacact 
aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
tacgggataa 
cttcggggcg 
ctcgcgcacc 
aaacaggaag 
tcatactttt 
gatacatatt 



tgccccgtat 
gcagatcaat 
ccacgcacgt 
tccgacaggt 
ttaaggcgtt 
aaagaaagga 
ctgtttttgt 
cagcaaaagg 
ccccctgacg 
ctataaagat 
ctgccgctta 
agctcacgct 
cacgaacccc 
aacccggtaa 
gcgaggtatg 
agaagaacag 
ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 
cgggagggct 
gctcctgatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
taccgcgcca 
aaaactctca 
caagtgatct 
gcaaaatgcc 
cctttttcaa 
tg 



29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31672 



<210> 4 
<211> 30365 
<212> DNA ' 

<213> Artificial Sequence 
<220> 

<223> derived from Adenovirus 



<400> 4 

gaatcggcca 

cttattttgg 

gcgtgggaac 

ttgtagtttt 

gaagttgtgg 

gttttttgtg 

ttggcccttt 

aggttttttt 

ttctggagcg 

agttatggcg 



gcgcgaatta 
attgaagcca 
ggggcgggtg 
tttaaaatgg 
gttttttggc 
gactttaacc 
ttacactgtg 
actggtaagg 
ggagggtgct 
cgccagatct 



actataacgg 
atatgataat 
acgtaggttt 
gaagttacgt 
tttcgtttct 
gttacgtcat 
actgattgag 
ctgactgtta 
attttgccta 
gtttgtcacg 



tcctaaggta 
gagggggtgg 
tagggcggag 
atcgtgggaa 
gggcgtaggt 
tttttagtcc 
ctggtgccgt 
tggctgccgc 
ggataacttc 
cccgcacctg 



gcgtcatcat 
agtttgtgac 
taacttgcat 
aacggaagtg 
tcgcgtgcgg 
tatatatact 
gtcgagtggt 
tgtggaagcg 
gtataatgta 
gttttgcttc 



cataatatac 60 
gtggcgcggg 120 
gtattgggaa 180 
aagatttgag 240 
ttttctgggt 300 
cgctctgtac 3 60 
gttttttaat 420 
ctgtatgttg 480 
tgctatacga 540 
aggaaatatg 600 
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actacgtccg gcgttccatt tggcatgaca 
cgcactccgt acagtaggga tcgcctacct 
ctggaggatc atccgctgct gcccgaatgt 
gtgcgaggtc ttccctgcag tgtgggattt 
gatatggttc tgacgcggga ggagcttgta 
tgttgtgcca acattgatat catgacgagc 
ctccactgtc attgttccag tcccggttcc 
gccagctggt ttaggatggt ggtggatggc 
cgggaggtgg tgaattacaa catgccaaaa 
aggggtcgcc acttaatcta cctgcgcttg 
cccgccatga gctttggata cagcgccttg 
ctgtgctgca gttactgtgc tgatttaagt 
acaaggcgtc tcatgctgcg ggcggtgcga 
tattcctgca ggacggagcg gcggcggcag 
cgccctatcc tgatgcacga ttatgactct 
ccgcccgttg agcaaccgca agttggacag 
atgaacttaa gcgagctgcc cggggagttt 
caggaaaccg tgtggaatat aacacctaag 
tttaaggcca gccggggaga aaggactgtg 
ttgaatacta gggttctgtg agtttgatta 
ggtggggcta tactactgaa tgaaaaatga 
acgttgaaac ataacatgca acaggttcac 
ggtgtaagag ttggtagcaa aagfcttcagt 
aaaagacata gagtaagtgc ttacctcgct 
tgtaatggta agtatcatag gtttagtttt 
gttattttta gcagtttgac tttgggtttt 
aactgcattt gtgtatggat ttgcattagt 
ttttttaagt gaggagttct ccattagaac 
aacacttgca acggtgcctg tcatggatga 
agtagctagt acttgactcc cacattttgt 
tgaatgaatt ctgcagttag gagatgggtc 
atttttgttt cctattgtaa tggcccctga 
agtaatcatg gcaccgtttt cattgtaatc 
gttgatatct ggagactcag atgtgtttgt 
agctatggca gtattatcaa agtttagtcc 
tttagtattg tttgatgcat taaaaaggta 
atgagatgca ttaatataca ggggtccctg 
atcaaatggg taatccacafc ctagaattaa 
cgttttaatt tccatgttgt ttgatgaatc 
aagggagttt tgttcaacgg tgacacctgg 
ttgtgctact tgcaaaggac cgcttatttt 
aggatcttcc atgttaatgc ccaagctacc 
tacagtaagg gtgtcgctgt cactgccaga 
tccatctgac actgtaatgg gccctttagt 
cagtggggct tgtgactgta cgctaagagc 
tgccactgtt agggcgcctg aggtaattgt 
tgactttgtt tttttaagtg gctgagtaac 
ggccttgtct agggtaagac cgctgcccat 
caaaggttcg gagacgcgta gagagagaac 
tgaaacaaat ggaggggtaa gaaagggcac 
atacacgggg ttgaaggtgt cttcagacgg 
gatagtgggt gcggagggac aagaacatga 
aaagtttgca gctaaaaggc ggctgagata 
gctgaataag ctggacaaag atttgctgac 
agccgtccgc cgagtctttc accgcgtcaa 
cgggaggtcc agaaaagggg ttgaagtaaa 
gagttccaat gcctccggag cgcggctccg 
gacggggcgt aaatgaagag cggccagcgc 



ctacgaccaa cacgatctcg gttgtctcgg 660 
ccttttgaga cagagacccg cgctaccata 72 0 
aacactttga caatgcacaa cgfcgagttac 780" 
acgctgattc aggaatgggt tgttccctgg 840 
atcctgagga agtgtatgca cgtgtgcctg 9 00 
atgatgatcc atggttacga gtcctgggct 960 
ctgcagtgca tagccggcgg gcaggttttg 1020 
gccatgttta atcagaggtt tatatggtac 1080 
gaggtaatgt ttatgtccag cgtgtttatg 1140 
tggtatgatg gccacgtggg ttctgtggtc 1200 
cactgtggga ttttgaacaa tattgtggtg 1260 
gagatcaggg tgcgctgctg tgcccggagg 132 0 
atcatcgctg aggagaccac tgccatgttg 13 80 
cagtttattc gcgcgctgct gcagcaccac 1440 
acccccatgt aggcgtggac ttccccttcg 1500 
cagcctgtgg ctcagcagct ggacagcgac 1560 
attaatatca ctgatgagcg tttggctcga 1620 
aatatgtctg ttacccatga tatgatgctt 1680 
tactctgtgt gttgggaggg aggtggcagg 1740 
aggtacggtg atcaatataa gctatgtggt 1800 
cttgaaattt tctgcaattg aaaaataaac 1860 
gattctttat tcctgggcaa tgtaggagaa 1920 
ggtgtatttt ccactttccc aggaccatgt 1980 
agtttctgtg gattcactag tgccattaag 2 040 
atcaccatgc aagtaaactt gacfcgacaat 2100 
tggataggct agaaggttag gcataaatcc 2160 
tgagttccca tttctaaagt tccagtaatg 2220 
accgttttgg tcaaatctaa ggaatatact 2280 
aagatctcca gatacagcca aagcagctac 2340 
aagaaccaaa gtaaatttgc agtcattatc 2400 
tggggttgtc cacagggtaa gtttgtcatc 2460 
gttgtcaaag cttaaacccg ctccaagttt 2520 
aatgccagag ccaattttag tttttattgg 2580 
atcaaactcc agaccctttc ctgcatttat 2640 
actggatttt tttatgctaa cttccagttt 2700 
taggcctctg ttatagttta tgtccaagtt 2760 
ccccagttta agacgtagtt ttgtttgagc 2820 
caagttgtta tttatacgca tgccaccgcc 2880 
ataaccaata gctcctgcaa ctttggttct 2940 
tccagtaact actgttagtg tatcggagtt 3 000 
aattcctatt tttccattat ttacataaat 3060 
cgtggcagta gttagcgggg gtgatgcagt 3120 
gaggggggct gatgtttgca gggctagctt 3180 
agcaatgctt agtttggagt cttgcacggt 3240 
gccgctagta actatcagag gagcggtggt 3300 
aagtggtgcg gaggtgtcca aacttatgtt 33 60 
agtggttaca ttttgggagg tgaggtttcc 3420 
tttaagcgca agcatgccgt gggaggtgtc 3 480 
tccaggggga ctttcttgga aaccattggg 3 540 
agttggaggc ccggtttctg tgtcatatgg 3 60 0 
tctggcgcgt ttcatctgca acaatatgaa 3 660 
ggaatttgac atcccattta aactttggag 372 0 
ccagagttgg gaggaaggaa aggaggtgat 3780 
tgattttaag taagtaattt attcagtcgt 3840 
agttgggaat aaactggtcc gggtagtggc 3900 
ccgaaggcac gaactcctca ataaattgta 3960 
aggacgaggt ctgcagagtt aggatcgcct 402 0 
cgccgatctg aaatgtcccg tccggacgga 4080 
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gaccaagaga 
ggtgagttat 
cccctgagtt 
gagcgggact 
gaggtgtggt 
cgttgacccg 
cttgggcagc 
gaatttcctt 
gatgacgtag 
gcgcgcagta 
cgcttttgtg 
gctcttgttc 
ttctaagctc 
ctgctgccgc 
gcttctgtgt 
tagagatgac 
agaaagcggc 
ttcttgtgcc 
ggcggctgct 
cgaacaggca 
gccaatttct 
acctcttcgg 
tcgtccaaaa 
ggcggcagct 
atagggctgc 
aaccccccgt 
gccaaggatg 
ttggtcttcg 
gtaagccgac 
aggcgaggga 
attgcagacc 
gccgtcaccg 
cagaaggtag 
gtcgctagga 
cctaaagttt 
aagcaggcgt 
ggccgtccat 
ctcctccagg 
gtaaacgcag 
gttttgcccg 
cacgttggaa 
ttcctctagc 
cacggtaaca 
gcgcgtctca 
ttgttcttgc 
catgactgtg 
ctcgatgctg 
tccacagtga 
gtcgcgtcca 
gacagcgccc 
taggggtatc 
aaatacgggg 
ggggggtacg 

gaggggcaca 
atgcttcaac 
cccgacttgt 
tactgagcga 
ctcctcctca 



ggagctcacc 
accctgcccg 
agtcatctga 
ttcctggtac 
ggtaatagcc 
ggatatcatg 
tccagccgca 
gctcataatg 
ttttcgcgct 
tttgctgaag 
atacaggcag 
ttggcccctg 
gcgggtcgat 
cgctgtggat 
gaccgctgct 
ggtagtaatg 
gggcgaagga 
cgcgccacga 
tagacttacc 
gtgccggcgg 
ggggcgccgg 
cctcggaagc 
cctcctctgc 
gcagcttctt 
ggcggcgggg 
ccctttcgta 
tggccctggg 
tagaacctaa 
gtccacagcc 
ccctgcagct 
agggagcggt 
ctcacgtctt 
cagtgacccc 
agcgcacagc 
tgcaacatgc 
tcggggaaga 
aggtccttca 
cattgctgcc 
tcgcggacgt 
aggcggtttt 
attttgcagg 
ttgcgctgca 
agcactgcgg 
agccagcgcg 
aagtttgcat 
ctcataacct 
cgtttcagca 
ctttcatttt 
agaccctcaa 
tgccgcaagg 
ttgcagtttt 
tagaagttga 
cgcggtgaga 
tcgctgcgct 
agcacgtcgt 
tcctcgtttg 
tcctcgtcgt 
agcgggggtg 



gactcgtcgt 
ggcgaccgca 
acttcggcGt 
accagggcag 
gcctgttcga 
tggggtcccg 
agtcccattt 
gcgctgacga 
taaatttgag 
agagcctccg 
ctgcgggtga 
ctttgttgaa 
acgggttcgt 
ttcttgggct 
gttgctgccg 
caggatgtta 
gatgttgccc 
gcggtagcct 
ggccctggtt 
cgcctgagga 
cgaggggaat 
ttcgtctagg 
ctgactgtcc 
tttgggtgcc 
ggattgggtt 
gcagaaactc 
taatgacgca 
tctcgtgggc 
ccggagtgag 
caaaggtacc 
gcggggtgca 
ccatgatgtc 
aaagcggcgg 
aggtggcggg 
tttgactggt 
taatgtccgc 
agttttgctt 
acacgcccat 
agtcgcggcg 
cgtgcaaaat 
cctggcgcac 
tctccgggtc 
ccatcattag 
ccagctgctc 
ccctctccag 
tggggggtag 
cggctaggcg 
cgctgttttc 
agatttttgg 
ccagctgctt 
ggaaaaagat 
ggcgcgggtt 
acaggtggcg 
cttgcaacgc 
ctcccacatc 
cctctgcgtc 
cttcgcttac 
cctcgacggg 



tgagctgaat 
ccctgtgacg 
gggcgtctct 
cgggccaact 
ggagaattcg 
cgctcatgta 
gtggctggta 
caggtgctgg 
aaagggcgcg 
cgtcttccag 
gggagcgcag 
atatagcata 
tgggcgccag 
ttgtcagagt 
ctgccgctgc 
cgggggaagg 
ccacagtctt 
tggcgctgtt 
ccagtggtgt 
gcggaggttg 
gcgaccgagg 
ctgtcccagt 
cagtattcct 
atcctgggaa 
gagctcctcg 
ttggcgggct 
ggcggtaagc 
gtggtagtcc 
tttcaacccc 
gataatttga 
taggttgcag 
ggagtggtag 
agggcattca 
cagaattcct 
gaagtctggc 
caggtgcgcg 
tagcagcttc 
ggccgtttgc 
cgcctcgccc 
tccaaggtag 
gtagccctgg 
agcaaagaac 
cttgcgtcgc 
atcgccaact 
gggtcgtgca 
gttaagtgcc 
cgcgttgtca 
ttgttgcaga 
cacttcgtcg 
gtccgctcgg 
gtgataggtg 
gggctcgcat 
ttcgtaggca 
gtcgcagata 
taggtagtcg 
gtcctggtct 
aaaacctggg 
gaaggtggta 



acctcgccct 
aaagccgccc 
gggaagtacc 
^cggggatta 
gtttcggtgg 
gtttattcgg 
actccacatg 
cgccgggtgt 
aaactagtcc 
cgtgcgccga 
agacctgttt 
cagagtggga 
acgcagcgct 
cttgctatcc 
cgccggtgca 
ccacgccgtg 
gcaagcaagc 
gttgctcttg 
cccatctacg 
tagcgatgct 
gtgacggtgt 
cttccatcat 
cctcgtccgt 
gc aagggccc 
ccggactggg 
ttgttgatgg 
tccgcatttg 
tcaggtacaa 
ggagccgcgg 
ctttcgctaa 
cgacagtgac 
gcaaggtagt 
cggtacttaa 
gaacgctcta 
agaccctgtt 
gccacggagc 
tgcagctcct 
caggtgtagc 
ttgagcgtgg 
gagaccaggt 
cgaaaggtgt 
cgctgcatgc 
tcctccaagt 
gcgggtaggc 
cggcgcacga 
gggtaggcaa 
ccctcaagtt 
gcgtttgccg 
agcgaggcga 
ctgcggttgg 
gcaagcacct 
gtgccgtttt 
aggctgacat 
atggcgcact 
ccatgccttt 
tgctttttat 
tcctgctcga 
ggcgcgttgg 



ctgatttfcca 
gcaagctgcg 
acagtggtgg 
aggttattac 
gcgcggattc 
gttgagtagt 
tagggcgtgg 
ggccgctgga 
ttaagagtca 
agctgatctt 
tttattttca 
aaaatcctat 
Gctcctcctg 
ggtcgccttt 
gtaggggctg 
atggtagaga 
aactatggcg 
ggctaacggc 
gttgggtcgg 
gggaacggtt 
ttcgtctgac 
ctcctcctcc 
gggtggcggc 
gcggctgctg 
ggtccaggta 
cttgcaattg 
gcgggcggga 
attfcgcgaag 
acttttcgtc 
gcagttgcga 
actccagtag 
tggctagctg 
tgggcacaaa 
ggataaagtt 
gcagggtttt 
gctcgttgaa 
ttaggttgcg 
acagaaataa 
aatgaagcac 
tgcagagctc 
agtgcaacgt 
actcaagctc 
cggcaggctc 
cctcctcggt 
tcagctcgct 

agfcgggtgac 
ccaccagcac 
cgcgtttctc 
tatcaggtat 
cacggcagga 
ctggcacggc 
cttggcgttt 
ccgctatggc 
ggcgctgcag 
ggtccccccg 
Gctctgttgg 
taatcacttc 
cggcatcggt 



4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
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ggaggcggtg gtggcgaact caaagggggc 
catgatcttt ttctgcctat aggagaagga 
aaccaccccc gagcgcggac gcggtgcggc 
gtccccgtcg ccgtcgccgc cgcctccccg 
tctcgagtcc gaggacgaag aagactcgtc 
cccgcggcca tcgacctcga cggcggattt 
cccctctccc aagcccgagc gcccgccatc 
aagagaagat gtggcgctac aaatggtggg 
cggcaaggga ggtaagcgca cggtgcggcg 
tatgcggacg caagaggaaa aggaagagtc 
aaacccgctg agcctgccga tcgtgtctgc 
gttgatggac aagtaccacg tggataacga 
ccaagtggaa gctctggcgg ccgtatgcaa 
gcagctgacc ttcaccagca acaagacctt 
gtacctgcag tcgtttgcag aggtaaccta 
gtggctgcac cgctgcgctg agatcgaagg 
gataaataag gagcacgtga ttgaaatgga 
gaaggagcag tctagcaagg ccaagatcgt 
gatctccaac accgacgcaa ggtgctgcgt 
ttccggcaag tcttgcggca tgttcttctc 
gcagatcaag gctttcatgc aggcgctgta 
tctgatgcca ctacggtgcg agtgcaactc 
gcagctacca aagttgactc cgttcgccct 
gatctccgac aagagcgtgc tggccagcgt 
ctgcaaccct gtgtatcgca actcgcgcgc 
gatatcggcg cccgacctgc taaacgcgtt 
cttcaccgag ctgccgcgga tggttgtgcc 
tcgcaacgtg tccctgccag tggcgcatag 
aacggcgcag acggcaaggg tggggggtaa 
atttgccttt attgaaagtg tctcctagta 
aaagaagtgg cgctcctaat ctgcgcactg 
aggaagctgt agagctgttc ctggttgcga 
agcatggagt tgggtacccc ggtaataagg 
tggggccagt tggcaaaggc gtggagaaac 
ttgggcccct gcacgctttg ggtggacttt 
gcaatggcgc tacggcgcag gagtgactcg 
tggtcagaaa agccaaaggg ctcaaagagg 
aaggccatcc agtgtacgcc cccagtctcg 
agcttgtgtg gagaaacaaa gcctggaaag 
ccacaaccaa gatctttgac aatggctttc 
gctgttgttg atgttgcttg cttcttttat 
cgcaggtaca cggtctcgat gacgccgcgg 
acttcaaaca aaacataaag aagggtgggc 
tctagcgcgt gggcggagtt ggcgtagaga 
atggacataa agttactgga gaatgggatg 
ttctgggtaa tactgtcaac cgcggttttg 
taagcctgtc cctcgcgcat ggtgggagcg 
tggtgaagaa ttccaacctg ctgatactcc 
ctcatgggct ggaagtttct gaagaacgag 
tagaagccct ggtagccaat attgtagttg 
gtcatgttgc actgagctac gttgtagccc 
tcattgggag taagcaggcg gtcgttgccc 
gccaccttct taaaggtgtg attaagatag 
gagtaggtgt agtaagggtc gtagcctgat 
cgtgtgaagg cccaaccgcg aaatgctgcc 
ttggtggcgt tggcgggtat ggggtatagc 
gactggtcgt tggtgtcatt tctgagcatg 
atggggaaga aggtggcgta aagacaaatg 



ggttaggctg tcctccttct cgactgactc 7 62 0 
aatggccagt cgggaagagg agcagcgcga 7680 
gcgacgtcca ccaaccatgg aggacgtgtc 7740 
cgcgccccca aaaaagcggc tgaggcggcg 7 800 
acaagatgcg ctggtgccgc gcacacccag 7860 
ggccattgcg tccaaaaaga aaaagaagcg 7920 
cccagaggtg atcgtggaca gcgaggaaga 7980 
tttcagcaac ccaccggtgc taatcaagca 8040 
gctgaatgaa gacgacccag tggcgcgggg 8100 
cagtgaagcg gaaagtgaaa gcacggtgat 8160 
gtgggagaag ggcatggagg ctgcgcgcgc 822 0 
tctaaaggca aa.cttca.a.gc tactgcctga 8280 
gacctggcta aacgaggagc accgcgggtt 83 40 
tgtgacgatg atggggcgat tcctgcaggc 840 0 
caagcaccac gagcccacgg gctgcgcgtt 8460 
cgagcttaag tgtctacacg ggagcattat 8520 
tgtgacgagc gaaaacgggc agcgcgcgct 8580 
gaagaaccgg tggggccgaa atgtggtgca 8640 
gcatgacgcg gcctgtccgg ccaatcagtt 8700 
tgaaggcgca aaggctcagg tggcttttaa 8760 
tcctaacgcc cagaccgggc acggtcacct 8820 
aaagcctggg catgcaccct ttttgggaag 8880 
gagcaacgcg gaggacctgg acgcggatct 8940 
gcaccacccg gcgctgatag tgttccagtg 9000 
gcagggcgga ggccccaact gcgacttcaa 9060 
ggtgatggtg cgcagcctgt ggagtgaaaa 912 0 
tgagtttaag tggagcacta aacaccagta 9180 
cgatgcgcgg cagaacccct ttgattttta 9240 
ataatcaccc gagagtgtac aaataaaaac 93 00 
cattattttt acatgttttt caagtgacaa 93 60 
tggctgcgga agtagggcga gtggcgctcc 942 0 
cgcagggtgg gctgtacctg gggactgtta 9480 
ttcatggtgg ggttgtgatc catgggagtt 9540 
atgcagcaga atagtccaca ggcggccgag 9 6 00 
tccagcgtta tacagcggtc gggggaagaa 9660 
tactcaaact ggtaaacctg cttgagtcgt 9720 
tagcatgttt ttgagcgcgg gttccaggca 9780 
cgaccggccg tattgactat ggcgcaggcg 9840 
cgcttgtcat aggtgcccaa aaaatatggc 99 00 
agttcctgct cactggagcc catggcggca 99 60 
gttgtggcgt tgccggccga. gacLgggcgtg 1002 0 
tgcggctggt gcacacggac cacgtcaaag 10 080 
tcgtccatgg gatccacctc aaaagtcatg 10140 
aggttttggc ccaggtctgt gagtgcgccc 10200 
cgccaaaggg tgcgatcgca aagaaacttt 10260 
cctattagtg ggtagggcac gttggcgggg 10320 
aggtagccta cgaatcctga gttgttatgc 103 80 
ttgtatttag tatcgtcaac cacttgccgg 10440 
tacatgcggt ccttgtagct ttctggaatg 10500 
gccaacatct gcaccaggaa ccagtccttg 10560 
tccccgtcaa ctgagcgttt aatctcaaac 10 62 0 
ggccagctaa cagaagagtc aaaggtaatg 10680 
aaggttccgt caaggtatgg tatggagcca 10740 
cccagggaag gggtttcctt tgtcttcaag 10800 
cagttgcgcg atgggatgga gatgggcacg 10860 
atgttggcgg cggaaaggta gtcattaaag 10920 
gcttccagcg tggaggccgt gttgtgggcc 109 80 
ctgtcaaact taatgctagc cccgtcaact 11040 
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ctaagatcgt 
tcatatgtat 
acctgaatgt 
cccgcattgc 
gccccaaggt 
tcgtaggtgt 
ggcaggtaca 
atttccatgg 
tttgtccatg 
taggtgtcag 
gttccatggt 
atagaaaaat 
gtgtttctgt 
cccatgttgc 
taattgggtc 
tttccaggtt 
aaaaccactt 
aagaattgca 
tgaccaccaa 
gtttttttaa 
ccaatttgag 
gcattgtctg 
tgggcataga 
tcttcttcat 
gtttgttccc 
ccggagtagg 
gaagccatat 
tacgcggtat 
gtggttacgt 
tacgtctcgg 
tggcccgaga 
agcgggcgca 
tacgacacgt 
acgatgctgt 
ccgctggcgg 
cctggcagca 
gtgtgctggc 
tgctttagtg 
ctgtcgggcc 
ggaggctcct 
tggttggcca 
tggaaatttt 
ttgctgccat 
cagctgaagg 
gggcgcgagc 
agttacagga 
acttgttttt 
gccaatcgcc 
gtgcgacgcg 
ggccatgccc 
gcaggtgagg 
gtagcgcgct 
aatcggggcc 
gcgtcgggta 
cgcgatggaa 
ggaacggcgc 
cgtagaggtt 
ggcaaccggg 



ttcccagaga 
atgagcctgg 
gaaagggcac 
ggtggtggtt 
taatgtagca 
tggggttgtc 
gcgcaatatt 
caaagttgtt 
tagtatctcc 
tbaccccaat 
tttcaatgat 
atctggttct 
cttgcaaatc 
cagtgctgtt 
tgtttggcat 
tgtaagacag 
ttggtttagt 
agtcaacctt 
aaggatttgt 
gcactctccc 
gttctggttg 
atcctatttg 
catgtgtttt 
cttcatcttc 
actcacagga 
gcttaaaagt 
caagcacacg 
cctcgcggtc 
cgtgcgtagg 
tggcgcgggc 
tgtgcatgta 
cggcggctca 
bagctattta 
tcagtgtgct 
cacggcgcag 
caggtttctg 
ccagcactcc 
cctcctcgta 
gcggacgctt 
ctacgggagg 
ggtccaccac 
gctctttcaa 
agttcttaat 
cgccaccgct 
cgtgtcgcgg 
ccaagcgagc 
aatcagtgtc 
gcggcgatca 
cgccgccgcc 
ctcctacggt 
gccatatctg 
gttggcagca 
agcacgggct 
gtcgctcgtc 
ggatgtaggg 
gggcgccggg 
tt ggacgcgg 
gacgtttgtg 



gctctgcaga 
caggaggagg 
gtagcggccg 
aaagggatta 
gtctacaagc 
agatatttcc 
ggagtaaagg 
acccactcct 
attatcgcct 
acccccaaga 
tctaacatct 
atcacctatg 
taccacggca 
ataatacatt 
agattgttga 
atgtgtgtct 
agcattgcct 
tggaagaggc 
aggcctggca 
tcctgccgca 
ataggaagga 
tagcccgctt 
cttagtagcc 
ctcttcttca 
gttaggagcg 
aggccccctg 
gttgtcaccc 
cacagggatg 
tgccaccgtg 
aaactgcacc 
agaccactgc 
gcagctcctc 
gaagcatcgt 
ttgccagttg 
ggacgcgcgg 
ctgggtgtca 
ggtagccatg 
cgagggaggc 
ttcgccacgc 
gcggggatca 
cccgctaatg 
cttgtccctc 
ggtggaaccg 
catattgctg 
ggccagagac 
gtgagactcc 
tctgcgcctg 
gtggaataag 
ggtggtgcgc 
gcattcttcc 
caagaaccac 
ccagggtcct 
ggcgacggcg 
ttctggggcg 
catattcggg 
gggctgaaac 
ccgcagcggc 
tctccatgcc 



accatgttaa 
aggtttttaa 
tttcccaaca 
acgttgtcca 
ccgggagcca 
acattggtgg 
aaatttctcc 
atttcattac 
gagccattgc 
ggaaaacaat 
ggatcatagc 
gaatcaagca 
tttagctgcg 
aggccaataa 
cccaacatag 

ggggtttcca 

tgccggtcgt 
accccttttt 
taagatccat 
ttagcatcag 
tctgcgtata 
tttgtaattg 
tgatctcgag 
tcctcggcaa 
cGcttgggag 
tccagcacgc 
acagccaggg 
aaccgcagcg 
gggtttctaa 
agcccggggc 
ggcatcatcg 
tggcggcgac 
cggcgcttca 
ccactggcta 
ctagggcggg 
gcggggggag 
ggcgcgatgg 
tcatctattt 
ccctctggag 
agcttactgt 
ccagaggcca 
agcatctggc 
aaatttttaa 
gtgccgatat 
gcaaagttga 
agacttttta 
caaggccacg 
ga.ggggca.gg 
acgacgcatg 
tcggaatccc 
aaagaccggc 
gcctccttcg 
acggcggcgg 
gtaggtgtag 
cagtagtgcg 
gcgaaacatc 
cgcctgcacc 
ctctgtggca 



catccttcct 
tggcaaaaaa 
acatggagcg 
tgtagtccag 
ccactcgctt 
ggttgtattt 
ataggttggc 
gtgttgcaaa 
cattagcctt 
aatttggcaa 
tgtctacagc 
agagttgata 
atgcctgacc 
aattgtccct 
ctttagaatt 
tatttacatc 
tcaaagaggt 
catccggaac 
agcatggttt 
cttcgttcca 
caggtttagc 
tttctccaga 
cgttttgctc 
ctgcccggcc 
ctagagcgtt 
cgcggatgtc 
tgaaccgcgc 
tcaaacgctg 
acttgttatt 
tcaggtactc 
aaggggtagc 
atggacgcat 
gggattgcac 
cgggccgcat 
ttacaacaac 
gcaggtccag 
gacgggtggt 
gcgtcaccag 
acactgtctc 
taatcttatt 
ggccatctac 
ctgtgctgct 
tgccgctcca 
cttgccagtt 
tgtcttccat 
ttttgatttt 
gatgcaattc 
ataccgccgc 
ccgcccgtca 
ggcaccggga 
ttttaaacga 
cgagccaccc 
cgggttccag 
ccacgatagc 
ctggcggtgc 
cacgggtccg 
gcggcatctg 
gtggcaatac 



gaagttccat 
cttttggggc 
ataacggagg 
agaccagcgc 
gttcatgtag 
tagcttgtct 
atttaggtta 
agtttcatct 
aatagcttga 
ttcatcctca 
ctgattccac 
ggacagctct 
agcaagaaca 
gaaagcaatg 
ttcatcacct 
ttcactgtac 
agtatttgag 
cagaacggat 
catgggagtt 
ctgagattcg 
ttgtgtttct 
caaaggagcG 
ttcttcttcc 
gctatcttcg 
gtaggcagtg 
aaagtacgtg 
tttgtacgag 
ggaccggtct 
caggctgaag 
cgaggcgtcc 
catcttggaa 
acatgacaca 
ccccagaccc 
cgatcgcgga 
ggcggacggc 
cgttacaggt 
gggcaggcct 
agtttcttcc 
cacggccggt 
ttgcactgcc 
caccttttgt 
gttccaggcc 
cagcgagccc 
tcccatgaac 
tctacaaaat 
tccacatgca 
cgggcacggc 
gcatgcgacg 
ggccgtggcc 
aacggaggcg 
tgctggggtg 
tgcgcacgga 
tggtggttcg 
cgggggtagg 
cgtacttcct 
tttgcacctc 
ccaccgccga 
tagtgctact 



11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
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ggtggtgggt 
tggccgcacg 
ttccagtcgc 
gcgctttagg 
cacgggcttt 
atgccgcttt 
gtcgccgtac 
aaaaactacg 
ccgccgcctg 
atcatctttc 
cttcttcggg 
gcgcttggac 
taagtagttt 
gccgctaacc 
aatggccgcg 
acccgcggcc 
gttaagcagg 
tgctacgcgc 
ctgaatggcc 
ctccaccacc 
cgcgccccag 
ctttgccccg 
cgatataagg 
tatgcccagg 
tcgctggact 
aacgttttca 
gggaaagcga 
ggcctggtcg 
aaagttgctg 
cgaccagtag 
gcggatcccg 
gaaggtgctg 
tttgatcacc 
ttctgccgct 
gcgaatggca 
gttctcttcc 
ggtgtcatct 
accctccagg 
gcccaacaag 
gatgtctgga 
gtcaaacccc 
cccgttctgc 
ggtcatggtc 
ctccacctgt 
ggtaaacatg 
gcGCcagtgc 
gtcattgttt 
tgccacatcc 
caactcagag 
aaacggcggg 
gctcaccacg 
actaagggga 
tgcatcatgc 
caaacgggtt 
cctcccacac 
cgggcacatc 
tctctatccc 
gcgagttgtt 



atctgaacgt 
cggacctcgg 
tggcgcttgg 
ccgcgctttt 
aggctaggtg 
ccgtaggcaa 
acctcatcat 
gtggtgccgg 
ggcgcggttg 
tttttctttt 
gggccataga 
atagcttcgt 
tttcttgcaa 
gagtcgcgca 
gctgctgcgg 
agccttcgag 
gccgccgccg 
ctccgccgtc 
gcgtccactg 
gcgtcaatgg 
ggcgcgcggt 
ccaaacatct 
atggacatgt 
gccttgtaaa 
cctccgatgc 
ctgacggtgg 
ttgaacacgt 
ttgtagaagc 
atctggctcg 
acctgctccg 
gtctgagggt 
tcattgctta 
ggtttcttct 
gcctcggcct 
tgatcgttca 
gcgccgctgc 
tttaagcttg 
tcatcgtagg 
ttgctcaggc 
tggaaggctt 
agtctgaagt 
ctgcccactt 
tcggagtagt 
ttgtccttag 
aactcgtfcca 
gaacggtcgt 
tgaatgaccg 
gttgacttgt 
taacggatgc 
tccaggggag 
ctctcgtagg 
atacaagaaa 
tttttttttt 
gccactccct 
cagaccccgc 
cctgtgctcc 
gttgttggga 
taggtactcc 



ccacggtctg 
gctcca.gccc 
gtaccatcag 
cttcggacgg 
ttgggttgcc 
actccccgag 
acacgcgctt 
gtcgcaaaac 
cgtgcagcag 
tgacccgctt 
tctccggcgc 
tgcgcgccgc 
tctagttgcg 
cccagtacac 
cggccgctcg 
cggcccgcat 
cgcgttgggc 
tcttcatttt 
tggacactgg 
cgtcatcgac 
agtgcccgcg 
tgcttgggaa 
ttgctcaaaa 
cgtaggggca 
tgttgcgcag 
tgatggtggg 
gggtcagaga 
tcttggagtg 
tggagcggaa 
agccgcaggt 
cgccgtagtt 
ttaggttgta 
gaggcttctc 
cagcgcgctt 
tgtcctccac 
cactgttgtt 
cctggtaggc 
tgatcctaaa 
ggctgtgggt 
cgtttgtata 
tgcgggtgtc 
tcaagtagtg 
tgccctcggg 
taagcgagcg 
catttggcat 
caagattgat 
tggttagaaa 
tgtccacaag 
tgtttctccc 
catcgaaggg 
agggaggagg 
accaacgctc 
tttttttaaa 
cccaaatcca 
tgacggtcgt 
tgcgcatacg 
aatgccggag 
tcctcgccca 



cacgcccagt 
aggctccacg 
ctgcacggtg 
tgcaagcgtg 
ctcgtccagc 
gcgctcgttg 
gtaggtgcgg 
acgtcttacg 
ttccacctcg 
tagctttcgg 
gatgacctgg 
cgccgctgga 
cggggggcgg 
gttgcccctg 
tcgcctggac 
ggccgcccgt 
ggcagtgccg 
agcataacgc 
tggcggcgtg 
ggtggtgcgc 
cacgcgcact 
gcgcaggccc 
agtgcggctc 
ggtgcggcgt 
cggtagcgtc 
ggctggcggg 
ggtaaactgg 
cacgggcaac 
ggtcacgggg 
tacgtcagga 
gtatgcaagg 
actgcgtttc 
gacctcgggt 
ctcctccgcc 
cggctgcatt 
gccgccgcct 
gtccacatcc 
gccctcctgg 
gaagtccacc 
taccGcaggc 
aaactttacc 
ctccacgatc 
cagcgtgaac 
cgacaccatc 
gttggtatgc 
ggtctgtgtg 
gttgctgtgg 
gtacacacgg 
cccggtaggc 
ggaacccagc 
accttcctca 
ggtgccatgg 
acattctccc 
ggacgctgct 
gcctttgacg 
tcttccatct 
gcaggttctt 
gcaggcgcgg 



cccggtgcca 
gtcatttttt 
ggtgccaagt 
ggcagcacct 
ggcaacgcca 
gcctgctcaa 
gtggagcgct 
cgtcgacctt 
tcgtcaagtt 
ggcttgtaat 
agcatctctt 
tacatacaac 
gtgcgcacgg 
cgaccctgag 
ctggggggca 
cggccggtgc 
ggtcggcggc 
cgggctccgc 
ggcgtgtagt 
ccagtgcggc 
gggtgttggt 
cagcctgtgt 
gataggacgc 
ctggcgtcag 
ccgtgatctg 
cgcgccaaaa 
cggatgagct 
agctcggcgc 
tcttgcatca 
gtgcaaagga 
taccagctgc 
ttgctgtcct 
tgcgcagcgg 
cgtgtggcaa 
gccgcggctg 
gcgccatccc 
aacagtgcgg 
aa-gggttgcc 
ccgcatcctg 
atgacaagac 
ccgatgtcgc 
gcgttgttca 
tccacccact 
acccgcgcct 
aggatggttt 
cttgcctccc 
tcgttctggt 
gtggtgtcga 
cgcaggtacc 
gccgccgcca 
tacatcgccg 
ccttggtgag 
cagcctgggg 
gtcgtctgcc 
acgggtgggc 
actcatcttg 
ttcgcgctgc 
gcgggtggtg 



cctgcttgat 
ccaagacatc 
caccagactc 
gctgcagtgt 
acatgtcctt 
gcaggtcctc 
caccgggcgt 
tccactgtac 
catcatcatc 
cctgctcttc 
ctttgatttt 
agtacgagtc 
gcacgcgcag 
tcatagcact 
cagtgacaat 
gacgtgcgcg 
ggtggcgacg 
gcaccacggt 
tgcgcgcctc 
cgcgtttgtg 
cggagcgctt 
tattgctggg 
gcggcgagac 
taatggtcac 
tgagagcagg 
tctggttctc 
gggagtagac 
ccaccaccgg 
tgtctggcaa 
gggtccatga 
ggtactgggt 
ctgtcagggg 
gggcggcagc 
aggtgtcgcc 
ccgcgttgga 
cgccctgttc 
gaatgttacc 
gcttgcggat 
gcagcaaaat 
cagtgactgg 
tttccagaac 
taaggtctat 
catatttcag 
taaacttatt 
tcaggtcgcc 
ccgggctgta 
agttcaggga 
ataggggtgc 
gcggaggcac 
ctggcgccgc 
cgcgctgcat 
ttttttattt 
cgaaggtgcg 
gagtcatcgt 
gggcgcgggc 
tccactaggc 
ggctgcagca 
cgagtgctgg 



14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
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taaaagaccc 
ctagatcgga 
gctcgcgctg 
tgtcgtccca 
cagggggcgg 
cggccgcgcg 
tgcccggttc 
gcatcaggtt 
ttaaactgtg 
cctccaggct 
aagtatgctc 
, ctaggtatgt 
acaggcgcga 
gcaccGccca 
cttgtctact 
tctcagtcac 
gcacgttggc 
cctgctgctg 
ccaggttggt 
agaccaggcc 
ccagcacctt 
agagccggtt 
tgatgcggtc 
gcagcgccgc 
cgcttagtac 
gccgacgttg 
cagggcgcgt 
ctctctatcg 
gtcgctgagc 
aacgcccagg 
gatctttacc 
cttgcccaga 
gcgctcctgc 
gggctctact 
tataaggaac 
aaagtcccac 
gtggttgtta 
taccaggtcg 
ctcaaagtcc 
tgcgtggaac 
gttctgccgc 
ctcaggagag 
ccgggcccgg 
tgcccctcct 
ctcttgccgc 
gcaaaaaagg 
ggaggcaaac 
gactcaaccc 
tttccagcct 
cgccgcgcct 
ggttcgacac 
accccggtcg 
cgcttacagg 
ccagagcgtc 
cgactttccg 
cggatatcat 
gcccctctac 
ttgcaacgac 



tatcaagctt 
caagctgctt 
ttgcaactct 
gaggaatcca 
tagccagtgc 
atgcaagtag 
catgttgcgc 
aaggctcacg 
caacgagggg 
gcccgtgtcc 
gtccacatgc 
gtcccgggac 
gttgggcgtc 
cagcccctgc 
ggtctggaaa 
cattagccgc 
gggctgtgta 
cgcgcgctca 
ctgcacgttg 
ggcctcatcg 
ctcgtgcgtg 
ggcctgcggc 
catgacctgg 
ccgcaccgcc 
tcgccgtcct 
ccagcgcgcg 
cggcttgggg 
ccgctgcccg 
tcgcgccgcc 
tcgtcgctca 
tccttgtcta 
ctgagcatgg 
accactatgc 
atgtttagca 
agctgcgcca 
agatgcatca 
aagctttttt 
gcggccgcca 
tcctcgcgca 
tttcgatccc 
ggcacgtacg 
ggcgctccta 
cgccgcgggg 
gacgcggtag 
tgctgaggag 
ggctcgtccc 
ccccgttcgc 
ttggaaaata 
aaccgcttac 
ggaaggaagc 
gcgggcggta 
tccgccatga 
ctctcctttt 
ccgaccatgg 
cgcgcctcca 
cgccttatgt 
cgccagccgc 
tacgtctttg 



ggaaatgggc 
ggcctgcgga 
agcagggtct 
tcgttaccct 
gggttcaaga 
tccattaggc 
gcggtcatgt 
ctctgctgca 
atcttctgcc 
tcctgcccca 
gcctgaccta 
acgctgccac 
agcaagctag 
aagttcttga 
aaatagtctg 
agtgcgctca 
ctcaggaagg 
cgctgcgcca 
ccgctgttgt 
ggccggatgg 
gggtttgcgc 
tgctgccgga 
cgccagtcgt 
gggtccgttg 
ctggctcgta 
cgggtgccac 
cccagcgcag 
tgccagccag 
ggctcacgct 
aggtaagcac 
tgggaacgta 
aatagttaat 
tctgcagaat 
gcgcatccct 
tgagcggctt 
gtcctatagc 
gaaagttaat 
cgtgtgcgcg 
gcaaccgctc 
gcatctcctc 
cctcgcgcgt 
gccgcgccag 
gttcgtaatc 
gagaagggga 
gggggcgcat 

tgtttccgga 
cgcagtccgg 
accctccggc 
gccgcgcgcg 
caaaaggagc 
accgcatgga 
tacccttgcg 
gcacggtcta 
agcacttttt 
ccaccgccgc 
tggaagacct 
cgccgcactt 
actcaagggc 



tactcgcatc 
agctttcctt 
gcggttgcgg 
cgggcacctc 
tggcattggt 
gattgataaa 
ccagcgccac 
catagcgcaa 
gccggttggt 
gcgcgcggct 
tggcctcgcg 
tgtccgtgaa 
acacggtcgc 
aagcctggct 
gcccggactg 
caaagttggt 
cgtttagtgc 
cggcctcgcg 
aacgagccac 
ccctgttttc 
gcgccgggac 
acgcgtcagg 
ccgtggagtt 
cgtcttgcat 
ctcatcgtcc 
cgccagccca 
gtcagcgccc 
ggccctttgc 
cacggccttg 
cttcaacgcc 
aggggtatgg 
ggcggccacc 
gtttatcaaa 
gaatgcctcg 
gctatttggg 
cacctcctcg 
ctcctggttc 
cgcgggacta 
gcggttcagg 
gggctcctct 
gtcacgcttc 
gccctcgccc 
accatctgcc 
gggtgccctg 
ctgccgcagc 
ggaattfcgca 
ccggcccgag 
tacagggagc 
gccagtggcc 
gctcccccgt 
tcacggcgga 
aatttatcca 
gagcgtcaac 
gccgctgcgc 
cggcatcacc 
cgcccccgga 
tttggtggga 
ttactcgcgt 



tgaccgcggg 
tcgcagcgcc 
ggaaaacacg 
aaatcccccg 
gaaatactcg 
cggccggttt 
gctgggcgtt 
gatgcgctcc 
cagcaggtag 
gacacttgta 
gtacagtgtc 
gggcgctatt 
gcggtcgcct 
caggtttacg 
gtacacctca 
gtagtcctcc 
aaccatggag 
cacatccccc 
gcgctgaagc 
ggccagcgcg 
caccgcttcc 
gttacgcgca 
aaggccggac 
catctgatca 
tcgtcatatt 
ggtccggccc 
gcgtcaaagt 
aggctgtgca 
tggatgcgct 
atgcgcatgt 
tatatcttgc 
ttgtcagcca 
tcgagcagcc 
ttgtccctgc 
ttttgctcca 
cgcgccacaa 
accgtctgct 
atcccggtcc 
ccatgccgca 
ccctcgcggt 
agctgcaccc 
tcctccaagt 
gccgcgtcag 
catgtctgcc 
accggatgca 
agcggggtct 
actcgaaccg 
gagccactta 
aaaaaagcta 
tgtctgacgt 
cggccggatc 
ccagaccacg 
gactgcgcac 
aacatctgga 
tggatgtcca 
gccccggcca 
tatcagtacc 
ctcaggtaca 



gccgcagcgc 
gcctctgcct 
ctgtcgtcta 
gtgtagaaac 
gggttcacgg 
gaggcataca 
accccgtcgc 
tcctcgctgt 
ttcagggttg 
atctcctgga 
agcaagtgac 
agcagcagca 
gtgggagccc 
gtctgcaggc 
ctttgcggtg 
tgtccccgcg 
cccaggttgc 
accagccggt 
agcgcgtcgt 
tttacgatcg 
agaattgcgg 
gtcagcgaca 
ggctggctct 
gaaacatcac 
cctccacgcc 
cagctgcctc 
aggactcggc 
tcagctcgcg 
cgttgcgata 
agaacccctc 
gggcgtaaaa 
ggctcaagct 
agcggccctc 
tgtgctgcac 
gcgcgcttac 
gcgtgcgcac 
cgtacgcggt 
gcgcgtcggg 
actcgcgccc 
cgcgaaacag 
ttgggtgtcg 
ccaggtagtg 
ccgcggatgt 
gctgctcttg 
tctgggaaaa 
tgcatgacgg 
ggggtcctgc 
atgctttcgc 
gcgcagcagc 
cgcacacctg 
cggggttcga 
gaagagtgcc 
gcctcaccgg 
accgcgtccg 
ggtacatcta 
ccctacgctg 
tggtgcggac 
ccgagctctc 



18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
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gcagccgggt 
cacgggcgca 
gcagcaggcc 
gggcttcggg 
cgccgtagcg 
agaaaggctc 
catggccgac 
catccgccgt 
caaccccgac 
ctggttagac 
cctcggtggc 
gccgcacggc 
gcgcccccgc 
gatcgagcgc 
cccaccgccg 
agaagaggcc 
tcttctggag 
ggacttctac 
gcgacgctgg 
ctttcagcgc 
ggtggtcatg 
gggaggcctc 
cgtggagcga 
aatcgcctat 
cgacaccgaa 
cttcacgcag 
cctccgcgcg 
cccggcgggt 
gcatcatcca 
cggggatgcg 
tatcacctgg 
ccgcctgacc 
gtcatcaagc 
gtccgatcca 
gaagaccacc 
cgcggtcgtc 
gcacgttttt 
cacatcaaca 
catcctcgca 
gcctttggga 
ctggtgaccg 
ccgcttacct 
tttcgcgacc 
aaccctcatc 
acctacgagg 
tacatcgtgg 
aacaaccgtt 
ggaaagatac 
acggactttt 
ctcaaagtca 
caggcatacg 
tacatgctag 
gaccgcgaag 
gacatcatca 
gtcaacaagc 
gccagcttca 
cagatagtct 
ccctcgcacg 



caccagaccg 
taccaccgct 
atattagccg 
gtcacacgca 
atagatgcaa 
atgcaagact 
cgcctgcgca 
ctcaagaccg 
cgccacccgc 
gcctttctcg 
ggagtaccta 
agccccccgc 
gagaacggcc 
tttgtcgacc 
ccagaagaag 
Gctgtagcct 
gaggagttaa 
gaggccatgg 
gttatgtact 
ctgcgaaact 
cgcgcccgcg 
aacgccttct 
gccggacgcg 
caagacaacfc 
attgattctg 
ssfcrcgccaga 
cagcaccagc 
cccgagcccG 
aggacacccc 
gcctcttgca 
acctagcccg 
tgtcgccgcg 
tccgaccgcc 
ccataactcc 
agccaaaccc 
aggtcaacct 
acaaaagcca 
gccactcctc 
ccgagcgtct 
agcagctcgt 
ccgcgcgaga 
tctactgcat 
acctgcaaat 
ttgcagactg 
aacttaaaaa 
gccacaacat 
ccgaggtgcc 
ttttcaacga 
tgctctggga 
tggttaggga 
cgctacccgt 
gctcttaccg 
agtttgtcct 
aggaaaccct 
tgcgcgactc 
acgtcttcca 
tccgagcaga 
aactatacga 



ttaactggtc 
ttgtggacat 
agcgcgttgt 
tgggaggaag 
gagatgcagg 
actacaaaga 
ttcagcaggc 
cctactttaa 
tgccgcccgc 
agaggttttc 
cacaacaatt 
caacccataa 
gcgccgtcac 
gcctcccggt 
aagaagaagg 
ttgagcgcga 
ccgtgtcggc 
agcgccttga 
tcttcgtggc 
acgccgtctt 
atgccgaagg 
cgcagctcat 
gagatctcca 
caggagacgt 
tcgaactctc 
ttcaggagat 
tcctgcccgc 
ccctacctcc 
cgcggcccac 
agtcatcgac 
cgccctgacc 
ggagctccag 
gcgcgcggac 
cctcgagcag 
gccaggcgag 
cgtgcaggat 
gcacgagtgt 
caactggtgg 
ctttgtcacc 
gcccttcatg 
cctagccgtg 
caccccagaa 
gctaatggcc 
ggccctgtca 
attgccctcc 
caacggcttc 
gggacccttc 
tgtcaccttc 
gcagggcgga 
cacctttgcg 
agaaaaggga 
ttcggaggcc 
caaccgcgag 
ggactactgc 
ctacgcctcc 
gcgtccaacc 
gcagcccgcc 
ttacgtgcgc 



cgttatggcc 
ggatgacttc 
cgccgacctg 
agggcgccac 
acaagaggaa 
cctgcgccga 
cggacccaag 
ttacatcatc 
cacggtgctc 
cgatccggtc 
gttgagatgc 
ccgggacatg 
cgagaccatg 
gcgccgtcgt 
ggaggccctt 
ggtgcgcgac 
gcgcaactcc 
ggccttgggg 
agaacacacc 
cgcccggcac 
gorgcgtggtc 
ggcccgcatc 
ggaggaagag 
gcaggagatt 
tttcaggttc 
caaccgccgc 
gcgcggcgcc 

gggggcccgc 
cgcccgccgc 
gccgccacca 
cggctatgcg 
accatggaca 
atctggactt 
ccagacggtc 
gggctcaaat 
gtacagcccg 
tcggcccgtc 
cgggagatcc 
tacgatgtag 
ctggttatga 
gaccttggat 
aaaatggcca 
cgtgacctgt 
gaacacgggc 
atcaagggca 
gacgagatcg 
cgcatcacac 
gccctgccaa 
tgcgacgaca 
ctcacccaca 
tgctgcgcct 
gacgggtttc 
ctgtggaaaa 
gccctagacg 
ttcgtgcgtg 
atatcatcca 
cgtagcaacc 
gccagcatcc 



aactgcactt 
cagtctaccc 
gccctgcttc 
ctacggccaa 
ggagaagaag 
tgtcaaaacg 
gacatggtgc 
agcagcacct 
agcctacctt 
gatgcggact 
atcgttagcg 
acgggcggcg 
cgccgtcgcc 
cgccgccgtg 
atggaagagg 
actgtcgccg 
cagtttttca 
gatatcaacg 
gccaccaccc 
gtggagctca 
tacagccgcg 
tccaacgacc 
atcgagcagt 
ttgcgccagg 
aagctcaccg 
gtcgtcgcgt 
gacgtgcccc 
ccgcgtcacc 
gcggtaccgt 
accagcccct 
aggtaaacct 
gctcccatct 
tgggctcgcg 
aaggacaagc 
tcccactctg 
tgcaccgctg 
gcagggactt 
agttcttccc 
agacctatac 
agttcggcgg 
gggaccgctg 
taggtcgcca 
ggagctcatt 
tcagctcGcc 
ccccgcgctt 
tgctcgccgc 
gcaactttat 
acccgcgttc 
ctgacttcaa 
cctcgctccg 
accaggccgt 
cgatccaaga 
aaaagggaca 
tgcaggtcac 
acgcggtagg 
actcacatgc 
tcggtcccga 
gcggtggaag 



acaccatcaa 
tcacgcaggt 
agccgatgag 
actccgccgc 
aagtgccggt 
aagcctgggg 
ttctgtcgac 
ccgccagaaa 
gcgactgtga 
cgctcaggtc 
ccgtatccct 
tcttccaact 
gcggggagat 
tcccccctcc 
agattgaaga 
agctcatccg 
acttcgccgt 
aatccacgtt 
tcaactacct 
atctcgcgca 
tctggaacga 
tcgccgccac 
tcatggccga 
ccgccgtcaa 
ggcccgtcgt 
tcgccagcaa 
fcgccccctct 
gcttttagat 
agtcgcgccg 
ggaaatcagg 
gcaggagctc 
gcgcgatgtt 
cggcgtggtg 
agccgaagta 
cttccttgtg 
ccagfcactgc 
ctactttcac 
gatcggctcg 
ttggatgggg 
agatgagcct 
ggaacaagac 
gtttaggacc 
cgtcgcttcc 
tgaggagctc 
cttggaactt 
ccaggtaatt 
gcctcgcgcg 
caaaaagcgc 
ataccagtac 
gaaggccgcg 
caaccagttc 
gtactggaaa 
ggataagtat 
cgccgagctg 
tctcacagac 
catcttcagg 
cctcctcgct 
atgctaccct 



21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 



-28- 



wo 02/20814 



PCT/USOl/27682 



acatatcttg 
tccgcgctca 
gccgcccgcg 
gcgcgcctgc 
gacccactac 
ctacgcggag 
gtgcacctgg 
gaatacgtgc 
ctgcgctcca 
gacaacaaaa 
gcgggccagg 
gtcatgcccg 
gcggaagaga 
cccggtcacg 
atgtgtcttc 
cacttagcct 
ctatacgagg 
gacacggaca 
aaacgcatca 
tggctcgtgg 
tcggtatttc 
gcctcctcca 
accatggtca 
agcaggacca 
gtgacccaga 
ctggacgagc 
atatgctgga 
ttggtcaaac 
cttccttgca 
aaaacatgca 
gctgcagctc 
gctgcggtaa 
cggaaacggt 
cgtgggaaat 
cgcagtctgg 
aacacaacta 
ccattgccat 
agttcttcca 
ctgtgctggt 
acctaaaaat 
accgctttgt 
acatttttag 
cgcagcatga 
atctgaacat 
accgagaccg 
acacttgctt 
gggaggggag 
aacgacatta 
atcgcgggcg 
agtcacaatc 
caacggcgtt 
gcgaccatca 
cgttgtcaaa 
tgctgcaaaa 
taagccccgc 
ggttgccccg 
gatgcagatc 
cctccacgca 



gaatactcag 
cccaccccat 
catggcagca 
tgcccggggt 
cgccattctg 
aggtagccac 
tgcccgacga 
agctaaacat 
tcgccaagtt 
agattgtctt 
tgaatatcaa 
cttttgagag 
gtgaggacga 
tggcctacac 
acaccctgga 
ccttcgtgct 
aggaccgcgg 
gccfctttcgt 
aaaagcatgg 
aatgcgagac 
tcgcgcccaa 
agggcaagct 
aatgctacct 
gcctcaagcg 
ctacgctgac 
accgactact 
tcgagatgcc 
gctcaaaagc 
agaactgcta 
agtcagggac 
tcttaactac 
gtcgcagctg 
tttcttcatc 
gcaaatctgt 
caccctccgc 
tgacgttagt 
cattatggac 
cgcatttcct 
ggttctgcac 
acagtccaag 
aaacacttac 
gcaccacgcc 
agctctgcag 
ccagagtcac 
ctggtcccgg 
gatcaaaatc 
gaagccttca 
agttcccggg 
gatgaacggg 
ccgcgggcgg 
ccagacacgg 
atgctggagc 
tatgagctca 
cagatacaaa 
ccatcgatgg 
tattcagtgt 
aattaatacg 
cgttgtgata 



agagcccctc 
gccatggggt 
ggcgctagac 
ctttaccgtg 
ttcgcgcaag 
cagcgttgac 
gcgcaccacc 
cgcggccaag 
gctgtccaac 
ttctgaccag 
atcctcctcg 
ggagtactca 
acgcgccccc 
ctataaacca 
gcgagtggac 
ggcctggacg 
aacaccgctc 
caccgagcgt 
gggaaacctg 
cgtctgcggg 
gctctacgcc 
gcgcgccaag 
ggccgacgcg 
caccctggcc 
gaggaccctg 
gccgtacagc 
gtagagcacg 
atgcctacgg 
tcgctgggcg 
atgcttaacg 
cagttgcagc 
ctcaggaacc 
gccccgcagg 
gagggtaact 
ccgcgctttg 
gatcccagaa 
gaatgcatgg 
tctaagctac 
aacatgaatc 
atgcatctca 
accaagggcc 
cagcgctcct 
tggtgctacc 
ctttaccacg 
gcctaccgcg 
caaacagagt 
gggcagaaac 
tcaaagaatc 
aagctgcact 
tggctgcagc 
tctcgtaggt 
ccatcacatt 
caatgcttcc 
actacatgag 
caaacagcta 
cgctgatttg 
atacctgcgt 
tgtagatgat 



tacgtttacg 
cccccactca 
ttgcaaggat 
gacgcagacc 
ggcggccgcc 
cttgtcaccc 
gtctttcccg 
gagcgcgccg 
gccctctacg 
atggacgcgg 
tttttggaaa 
ccccaacagc 
accccctttt 
atcaccttcc 
cccctagtgg 
cgagccttcg 
gaggacaggc 
ggacaccggc 
gtttttgacc 
gcctgcggcg 
cttaaaagtc 
ggccacgccg 
cagggcgaag 
agcgcgcagc 
cgcGcgtgga 
gaaagccgcc 
tgaccgagct 
cggacggcct 
gcgagcgcct 
aagtggcccc 
cggtaatagg 
tgctttcttc 
tagacatgat 
acgcccctgg 
taaaaatggc 
atafccttcgc 
aaaatctcgg 
atgacaaatt 
cccggaggga 
tatcGccacg 
tgcccctggc 
gctacgactg 
tccaccccag 
tcctggaaaa 
cgcgcaaaac 
ctggttttta 
ctgctggcgc 
caattgtgcc 
gcttgcaagc 
ggctgaagcg 
caaggtagta 
ctgacgcacc 
atcaaacgag 
acccccacct 
ttatgggtat 
tattgtctga 
cataattgat 
aatcattatc 



acatttgcgg 

acccatacga 

gcaagataga 

ccccggacga 

tctgctggac 

tgcacaaccg 

aatggcggtg 

atcgcgacaa 

ggtcgtttgc 

ccaccctcaa 

ctgacaatct 

tggccctcgc 

atagcccccc 

ttgatgccga 

acaacgaccg 

tctcagagtg 

ctctcaagtc 

tcatggaaac 

ccgaacggcc 

cggatgccta 

tgcactgccc 

cggaggggct 

accggcagcg 

ccggagcgca 

aagacatgac 

ccaacGcgcg 

gtgggaccgc 

caaaccgttg 

tctggcgcat 

cctgctcagg 

tgtgatttac 

ccagctgatc 

ccccccatct 

gccggatgga 

ctatgacgat 

ccaggccgcc 

aggtcacaag 

tcccaagtgc 

tatggctggg 

tatgcaccca 

aatcagcttg 

gatcatctac 

agacgggctt 

aatacacagg 

ccctaaataa 

tttatgtttt 

agatccaaca 

aaaagagccg 

gggctcagga 

gcggcggagg 

gagtttgcgg, 

ccggcGcatg 

ttggcgctca 

tatatattct 

tatgggtgct 

agttgttttt 

tatttgacgt 

actttacggg 



catgtacgcc 
gcgcgcgctt 
ctacttcgac 
gacgcagcta 
caacgagcgc 
cggttggcgc 
cgttgcgcgc 
aaaccaaacc 
caccaagctt 
aggcatcacc 
tagcgcagaa 
agacagcgat 
ttcaggaaca 
agagggcgac 
ctacccctcc 
gtccgagttt 
tgtatacggg 
cagaggtaag 
agagctcacc 
ctccccggaa 
ctcgtgcggc 
ggactatgac 
cttcagcacc 
ccccttcacc 
cctggcccgt 
aaacgaggag 
ctggaactgc 
aaaaactttg 
ttggtcaggg 
gatgacggca 
gggcccaccg 
tcccctacGC 
gaactcaaag 
accattatac 
ctcatcctgg 
gcccgtgggc 
ggcgtctcca 
accggataca 
aacatagcca 
tcccagctta 
ctactgaaag 
aacaccaccc 
atgcccatgt 
accctcaacg 
agacagcaag 
aaaccgcatt 
gctgctgaga 
tcaacttgtc 
aagcaaagtc 
ctgcagtctc 
gcaggacggg 
ggggcatgcg 
tggcggcggc 
ttcccaccct 
agcgacatga 
acgttaagtt 
ggtttgatgg 
tcctttccgg 



25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
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tgatccgaca 
ggtttaaggc 
tgaaaagaaa 
tctctgtttt 
ggccagcaaa 
cgcccccctg 
ggactataaa 
accctgccgc 
catagctcac 
gtgcacgaac 
tccaacccgg 
agagcgaggt 
actagaagaa 
gttggtagct 
aagcagcaga 
gggtctgacg 
aaaaggatct 
atatatgagt 
gcgatctgtc 
atacgggagg 
ccggctcctg 
cctgcaactt 
agttcgccag 
cgctcgtcgt 
tgatccccca 
agtaagttgg 
gtcatgccat 
taataccgcg 
gcgaaaactc 
acccaagtga 
aaggcaaaat 
tttccttttt 
atttg 



ggttacgggg 
gtttccgttc 
ggaaacgaca 
tgtccgtgga 
aggccaggaa 
acgagcatca 
gataccaggc 
ttaccggata 
gctgtaggta 
cccccgttca 
taagacacga 
atgtaggcgg 
cagtatttgg 
cttgatccgg 
ttacgcgcag 
ctcagtggaa 
tcacctagat 
aaacttggtc 
tatttcgttc 
gcttaccatc 
atttatcagc 
tatccgcctc 
ttaatagttt 
ttggtatggc 
tgttgtgcaa 
ccgcagtgtt 
ccgtaagatg 
ccacatagca 
tcaaggatct 
tcttctgcat 
gccgcaaaaa 
caatattatt 



cggcgacctc 
ttcttcgtca 
ggtgctgaaa 
atgaacaatg 
ccgtaaaaag 
caaaaatcaa 
gtttccccct 
cctgtccgcc 
tctcagttcg 
gcccgaccgc 
cttatcgcca 
tgctacagag 
tatctgcgct 
caaacaaacc 
aaaaaaagga 
cgaaaactca 
ccttttaaat 
tgacagttac 
atccatagtt 
cggccccagt 
aataaaccag 
catccagtct 
tcgcaacgtt 
ttcattcagc 
aaaagcggtt 
atcactcatg 
cttttctgtg 
gaactttaaa 
taccgctgtt 
cttttacttt 
agggaataag 
gaagcattta 



gcgggttttc 
taacttaatg 
gcgaggcttt 
gaagttaacg 
gccgcgttgc 
cgctcaagtc 
ggaagctccc 
tttctccctt 
gtgtaggtcg 
tgcgccttat 
ctggcagcag 
ttcttgaagt 
ctgccaaagc 
accgctggta 
tctcaagaag 
cgttaaggga 
taaaaatgaa 
caatgcttaa 
gcctgactcc 
gctgcaatga 
ccagccggaa 
attagttgtt 
gttgccattg 
tccggttccc 
agctccttcg 
gttatggcag 
actggtgagt 
agtgctcatc 
gagatccagt 
caccagcgtt 
ggcgacacgg 
tcagggttat 



gctatttatg 
tttttattta 
ttggcctctg 
gatccaggcc 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 
cgggaagcgt 
ttcgctccaa 
ccggtaacta 
ccactggtaa 
ggtggcctaa 
cagttacctt 
gcggtggttt 
atcctttgat 
ttttggtcat 
gttttaaatc 
tcagtgaggc 
ccgtagtgta 
taccgcgtga 
gtgccgagcg 
gccgggaagc 
ctacaggcat 
aacgatcaag 
gtcctccgat 
cactgcataa 
attcaaccaa 
attgggaaac 
tcgatgtaac 
tctgggtgag 
aaatgttgaa 
tgtctcatca 



aaaattttcc 
aaataccctc 
tcgtttcctt 
gcgagcaaaa 
ccataggctc 
aaacccgaca 
tcctgttccg 
ggcgctttct 
gctgggctgt 
fccgtcttgag 
caggattagc 
ctacggctac 
cggaaaaaga 
ttttgtttgc 
cttttctacg 
cagattatca 
aatctaaagt 
acctatctca 
gataactacg 
cccacgctca 
cagaagtggt 
tagagtaagt 
cgtggtgtca 
gcgagttaca 
agttgtcaga 
ttctcttact 
gaatacggga 
gttcttcggg 
ccactcgcgc 
caaaaacagg 
tactcatact 
gcggatacat 



28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30365 



<210> 5 

<211> 33 

<212> DNA 

<213> Homo sapiens 



<400> 5 

gcggaattcg gcttggtgac ttagagaaca gag 

<210> 6 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gcQ'^^atcct tgaacccgga ccctctcaca eta 

<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 



33 



33 



"30- 
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<223> derived from Adenovirus 



<400> 7 

actctcttcc gcatcgctgt 



20 



<210> 8 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Adenovirus 
<400> 8 

cttgcgactg tgactggtta g 21 

<210> 9 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Adenovirus 



<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Adenovirus 
<400> 10 

ggtgtccaaa ggttcggaga 20 



<400> 9 

ccgcacccac tatcttcata 



20 
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